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VOL. XXXI FEBRUARY, 1940 Parts III anp IV 


ON GENERALIZED ANALYSIS OF VARIANCE. (1) 
By P. L. HSU 


1. The Wilks-Lawley hypothesis. Suppose that, as a result of some random 
sampling, we are in possession of p(m,+n) quantities, where n>>p, calculated 
from the observational data. Calling these y;, and (i= 1, 2, ...,p; r=1, 2, ...,m43 
r’=1,2,...,n), we assume that in repeated sampling they follow the probability 
distribution 


l p n Pp 
“i,j= r= i,j= r= 


and that we have no previous knowledge of the values of the 7;,. Our problem is to 
test the hypothesis A): 


% =0 for i=1,2,...,9; r=1,2, ...,%. (Hp) 


For p = 1 the hypothesis is that to which every linear hypothesis* may be 
reduced, and the test amounts to that ordinarily employed in the analysis of 
variance. For n, = 1 the problem calls for Hotelling’s generalized “‘Student’s” 
test.t 

It should be emphasized that the y;,,, z;, and 9;, will not usually correspond to 
the original physical observations and physical constants belonging to the sampled 
populations, but are so derived from them that (1) is true and that A, is equivalent 
to some hypothesis regarding population constants that we want to test. In another 
paper we shall deal with a general class of “linear hypothesis”’ on multivariate 
normal means, showing how all of them can be reduced by a rotation of the sample 
space to the “‘ canonical’ H,. Here we content ourselves with the following example. 

Example. Case of k samples. 2;,, (t=1,2,...,p; v=1,2,...,4; t=1, 2, ...,m,) 
are k samples drawn respectively from k p-variate normal populations all of which 
have the same set of variances and covariances. Let the population means be 
£,, (¢=1,2,...,p; v=l,2,...,&). Let 


l k 
= M = — 
t=1 v=1 


(8, f=1,2, ...,p; val, 2, ...,&). 


* Kolodziezezyk (1935), Tang (1938). 
+ Hotelling (1931), Hsu (1938), Bose and Roy (1938) 
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222 On Generalized Analysis of Variance 
Suppose that we want to test the hypothesis: 
En = = eee = by for += 1,2, 


Then it is known that if we call y,, and z,;, certain linear functions of the x;,, and 9, 
certain linear functions of the &,,, (1) will hold true and the hypothesis under test 
is equivalent to H). Thus reducing the problem, we have 


n,=k-1, n=M-k, 


k n k 
VirYjr = M,(%;,— Xz) = DV Sigs 
r=] v=1 r=] v=1 


k 


(3, 2, ...,p). 
In particular, if, k = 2 then n, = 1, and we may drop the second index of y 
and 7. We have then 
M+ Mz 


YY; — Ziq) (Tj. — 


n Ms 
r= = = 
ae 
(s,j=1,2,...,p). 

The hypothesis in the above example has been studied by Wilks (1932), while 

an attempt to test the general hypothesis H, was made by Lawley (1938). Both 


assumed no prior knowledge of the values of the «;;. We propose to call H, the 
Wilks-Lawley hypothesis. 


2. Case where the «;; are known, generalizations of Mahalanobis’s distance. 
From now on we shall write 


n 
r=1 r=1 


rm Pp 
r= 


(3, j=1, 2, ...,p). 


In order to test H) when the «;; are known, we calculate the likelihood ratio* 
and get 


—2log (likelihood ratio) = a,a,, = S, say 
i,j=1 


and decide to reject H, if the value of S exceeds some fixed constant, chosen so as 
to fix at some desired level the risk of rejecting H, when it is true. 


* For the definition and interpretation of the term see Neyman & Pearson (1928). 
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THEOREM 1. For arbitrary values of the 7;, the distribution of S is that of the sum 
of pn, non-central squares :* 
2Q-ipn, Sipn,—l dS 2)" 
tpn, 
In particular, if H, is true, then ¥ = 0 and S follows the x? distribution with pn, 
degrees of freedom. 


Proof. Let the sets of variables (y;,, ¥pr), for r= 1, 2, ...,n, be subject to 


the same linear transformation such that, calling the new variables (u,,, ug,, ..., tar) 
P 
YirYir = Vins (r= 1,2, My). 


This is possible because the matrix ||«,; || is positive definite. Let 4,, be the same 
linear function of the 7’s as u,, is of the y’s. Then 


pm 
S= (3) 
and the wu;, follow the distribution 
1 P ™ 
exp >> (u;,— II du. (4) 
i=1 r=1 


From (3) and (4) follows the result (2),+ as we have 


Mir = Vir Mir = 
t,j=1 r=] 


r=1 
From (2) we get 6(S) = P+pn,, (5) 
= + 2( pn, + 2) ¥ + pn,( pn, + 2), 
whence = + 2pny. 
If these general results are applied to the above example of k samples, we have 
Pp k 
S= —%,) (%,—2;) = Sy, 
v= 


k => 


say, and the distribution 


(6) 


For k = 2 we have 


mm, 
= Az — 2X; 
Mm, 


— Sea) (Ej — 


* Fisher (1928), p. 669. + Fisher (1928), Tang (1938), pp. 138-9. 
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224 On Generalized Analysis of Variance 
and the distribution 


nao (h+3p)} 
The quantity Y, is equal to 
m,+ Mm, 


> 


where JA is Mahalanobis’s ‘“distance”’* between two populations having the same 
variances and covariances weighted according to m, and m,. The statistic 


which, according to (5), is an unbiassed estimate of 4, is called the D? statistic and 
the distribution (7) has already been dbtained.+ These concepts and formulae are 
completely generalized here by S,, %;, and the formulae (5) and (6) for the case 
k>2. 


3. Tests suggested by Wilks and Lawley: Generalized E?, 1— EH? and — E?)-. 
From now on we shall assume complete ignorance of the values of the «;;. In the 
case p = 1 the following functions are well known: 


E 
1 1 


n n 


2 
r=] r=] 


Wilks, while studying the k-sample case,{ suggested the functions U and W 
respectively as generalizations of H* and 1— E*. In our general case these are 
defined as 


a; 
if n,>p, (8) 


(9) 


Both are ratios of two determinants. The above defini:ion for U based on the idea 
of a generalized H* breaks down if n, <p, as then | a@,;| vanishes identically. This 
difficulty can be met if we define U as the product of the non-identically vanishing 
roots of the determinantal equation | a,;;—0(a;;+,;) | = 0. Ifn,>p, this definition 
evidently coincides with (8). 


* Mahalanobis (1936). Tt Bose (1936). t Wilks (1932). 
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As a generalization of H2(1— £?)-! we take, instead of UW-, the function V 
suggested by Lawley* and defined as 
= b%a;;, (10) 
i,j=1 


where b‘/ is the element (i, j) of the reciprocal matrix ||6;;||-?. 
If we denote by 9,, 9, ...,4,,, where 1, is the smaller of the integers p and n,, 
the non-identically vanishing roots of the cquation 


| a,;—O(a,;+5,;)| = 0, 
then, according to (9) and (10) and the new definition of V, we have 


U = 6, a) 
i=1 
= 1-4), (12) 
6; 
= 21-8; (13) 


W and V are test functions for H, suggested respectively by Wilks and Lawley. 
If we use W (or V), we reject H, if W is smaller (or V is greater) than some pre- 
scribed constant. In the following sections we shall study the distributions of 
U, V and W in repeated sampling. 
4. Case when H, is true. If H) is true, then, putting all the 7,;, = 0 in (1), we get 
the parent distribution 
1 2 
const. x exp) — 5 IT dydz. (14) 
The following theorem has been proved elsewhere.t 
THEOREM 2. Let l, be the smaller, and 1, the larger, of the integers p and n,. Let 
9, Oy, ...,0, be the non-identically vanishing roots of the determinantal equation 
| a;;— +5;;) | = 9, (15) 
arranged in the order of descending magnitude: 1>0,>0,>...>0,>0. Then the 
simultaneous distribution of the @’s, as derived from (14), is 


P}(n—p+l,+i) | 


Kn—p-)) ( \{ 


j i=1 j=i+1 


Therefore the distributions of U, W and V are those of the functions in (11), (12) 
and (13), where the @’s follow the distribution (16). 


* Lawley (1938). + Hsu (1989), Fisher (1939). 
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226 On Generalized Analysis of Variance 
Take the particular case /, = 2. We have then 
U =0,0,, V= 1-0, 1-9,’ 
whence Ui+Wi<l, V=W-\(1-U-W), 


and the distribution (16) becomes 


I'(n- l 


From (17) we easily obtain the joint distribution of U and W: 


I'(n—p+l,+1) 


dU dW. 


Integrating with respect to W ranging from 0 to (1 — U*)?, we get the distribution 
of U: 


(] — 
Similarly, the distribution of W is 


2B(,,n—p+1) 


Again, from (18) we find the distribution of V: 
(1+ 


+1) 4(1+ 7) 


ay} dV. (20) 


Now in the particular case considered where /, = 2 we have either (i) p = 2, 
1, = n, if p<, or (ii) n, = 2,1, = pif n,<p. Hence, from (18), (19) and (20) we 
get the following three pairs of distributions: 
l 
2B(n,—1,n) 
] 
2B(p—1,n-—p+2) 
1 
2B(n,,n—1) 
2B(p,n—p+1) 
I'(ny+n-— 1) 1 


1+ | 2=p<n, 


(1 — 2=p<cn, 


2=n, <p, 


WippadW, 2=n, <p, 


I'(n+1) 


(1+ +9) V (1 — dy, 2= <p. 
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The general expression for the moments of U and W can easily be deduced 
from (16). We have, from (16), 


C(lg,n) I ly, n) Ti ao, 


i=1 i=1 i=1 j=i+l 


whence 6&(U%W%) = re: +42) IT d6; = Cl, 

The case of k samples (see p. 221) of two variables (p= 2) has been studied at 
length by Pearson & Wilks (1933). The hypothesis H, and the functions U and 
W are called in their paper H,, U, and ZL, respectively. It is found there that the 
test based on U, (ie. U) cannot be regarded as an adequate test of H,. We shall 
show in the next section that H, is true if and only if two population constants, 
called A, and A,, both vanish. The disadvantage of U referred to by Pearson and 
Wilks, may be expressed by saying that it is unlikely to be able to detect the 
falsehood of H, if only one of A, and A, vanishes. . 


5. Simplification of the parent distribution function. The matrix ||q;;|| is 


positive definite; hence it can be expressed as CC’,* where C is some non-singular 
real matrix. Write 


Yor Yoo +++ Yen, Zan Yor Neo --- Yen, 


llores = = 

THEOREM 3. Let G be of rankl (hencel < p,l<n,and lis the rank of GG’ = || y;;|\), 
and let Ax, Ag, ..., A, be the non-vanishing roots of the determinantal equation 
| = 0. Let Og, ..., A, be the non-identically vanishing roots of the 
determinantal equation | a;;—O6(a,;+6;;) | = 0. Then the joint distribution of the @’s, 
as derived from (1), depends upon the parameters A,, Ag, ..., A,alone in such a way that 
instead of (1) we may regard the parent distribution as 


Pp Pp 
exp (— )exp|—5 5 but Mdyde, (21) 
i=1 i=1 i=1 


* The accent denotes the transposed matrix. 


| 
| 
J i=1 ig, 2) 
n 
2, 
> 
= 
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Hence, in studying the distribution of any functions of the 6’s, such as U, V 
and W, we may replace (1) by (21). 


Lemma 1 (Hotelling).* Let A and B be any positive definite matrices of orders 
m and n respectively, and C be any real matria of order m xn and rank l. There exist 
two non-singular real matrices, N, and Ng, such that 


N, AN; =I,¢ N,BNi=I, N,CN; =D, 


vAy 
D,, O VAs 


where D= 
| Oo 


and the A; are the non-vanishing roots of the determinantal equation 
| CB- 1C’—AA |=0. 
Allowing both A and B to be unit matrices in Lemma | we get the following: 


Corollary. Let C be any real matrix of rank l. There exist two real orthogonal 
matrices T, and T, such that 
r,cr, =D 
Ir 
where Dy, O = VAg 
oO 


and the A; are the non-vanishing latent roots of the matrix CC’. 


Proof of Theorem 3. We write tr A for the sum of the diagonal elements of any 
square matrix A. It is easily verified that tr(AB) = tr(BA) whenever AB is a 
square matrix. 


The expression inside the bracket in (1) may be written as 
—4¥—}tr(CC’YY’)— } tr (CC’ZZ’) + tr (CC’GY’) 
= (22) 

The A,, as defined in Theorem 3, are the non-vanishing latent roots of the equation 
| GG’—A(C’)*C-| = 0. On pre- and post-multiplying by | C’| and | C| respec- 
tively this becomes |C’GG’C—AI| = 0. Hence the A; are the non-vanishing 
latent roots of the matrix (C’G) (C’G)’. Therefore, by the corollary to Lemma 1, 
there exist two real orthogonal matrices, T, andT,, such that 
D,, O 


r,c’Gr, = D = 


(23) 


* Hotelling (1936), pp. 326-30. 
¢ I and O stand for a unit matrix and a zero matrix respectively. 
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The transformation of variables 


Z=(C’)“T,V, (24) 
Uy, Ug Un, Vy. Vin 
where Ug, Use Uen, Uae Ven 


are the matrices of the new variables, has a constant Jacobian and leaves the 6’s 
invariant, because the equation (15) becomes 


= 0, (1s) 
where a3; Ujes bi; = Var (2, j= 2, p). 
= rT 


To show this, we have, remembering the orthogonality of I,, 
|6;; || = ZZ’ = | VVT,C. 

Hence the equation (15) is transformed into 

| UU’ —@(UU’ + VV’)| = 0. 


which is another way of writing (15’). 


On the other hand, substituting (24) into (22) and remembering (23) we get 
the expression 


2i=1 ima * 

Replacing again the letters u and v by y and z respectively, we obtain the result. 
It may be noticed that 
Y = tr(CC’GG’) = tr(C’GG'C) = A,;. (25) 
i=1 

Remark. For the case of k samples (see Example on p. 221) Fisher (1938) has 
considered the problem of testing for the colinearity or coplanarity of the k 


populations. It can be proved that the hypothesis of colinearity (or coplanarity) 
states that all except one (or two) of the A’s vanish. 


6. Behaviour of V when H, is not necessarily true. The Laplace transform of a 
probability density function p(x) vanishing identically for x <0, viz. the integral 


J 
0 
has the following property. 
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230 On Generalized Analysis of Variance 
Lemna 2. Let p,(x), (n=1,2,3,...), and p(x) be probability density functions 
vanishing identically for x <0, and such that 


lim | = de (26) 


0 


for every a> 0. Then lim p,(u)du = p(u) du. 
0 0 


Proof. The function f,,(x) = p(x) — p,,(x) is summable in (0, 00) and, by (26), 
lim (x)dx = 0 for every a>0. (27) 


The function 9,(z) = f(x) dx 
0 


is an analytic function of z, regular at every z with a positive real part and uniformly 
bounded. By (27) g,(z) tends to a limit whenever z is real and positive. Hence by 
Vitali’s theorem of convergence* g,,(z) tends to a limit uniformly in the half-plane 
on the right of the imaginary axis. This limit is an analytic function regular at 


every z with a positive real part and vanishes whenever z is real and positive; 
therefore it vanishes identically. Hence 


n>oJ 0 


lim (x) dx = da (28) 
for all « > 0 and real t. Let t 
so that Him L,,(2) = L(a). (29) 
It follows from (28) and (29) ee 


whence, by a well-known property of the characteristic function, 


1 
mal. (us du n(u) du, 


whence lim | e-*p,(u)du -{ »(u) du, 
n>oJ0 0 
zx 
whence lim lim (u) du -| p(u) du. 
0 
The order of the above repeated limit can be interchanged because 


0 0 


uniformly in n as a> +0. This completes the proof. 
* Titchmarsh (1932), p. 168. 
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We shall now derive an expression for the Laplace integral &(exp(—aV)). To 
simplify the notation we shall use a single letter, e.g. y to denote a set of variables, 
e.g. all the y,;,, when they figure as arguments of a function. We shall write dy for 
/Tdy and a single integration sign for multiple integrals with respect to the variables 
from — © to 00. We denote the integral &(exp(—aV)) by L(a). 


The following equation can be verified by direct integration, on referring to 
(10) as the definition of V: 


exp (—$a?V) = {fe z, a, t)dt, 


where 


i=l1r= 
r= 


and where the letter i, when not figuring as an index, stands for ,/— 1. Denoting 
by p(y, 2) the coefficient of /7dy dz in (21), we have 


L(}a%) = fa 2,a,t)dydz, (30) 


as the change of the order of the integration is obviously legitimate for every real a. 
By direct substitution it results 


[rw. z) f(y, 2, a, t)dydz = + exp (— f,(a, t) t), (31) 


1 
The integral (32) is readily evaluated and runs 


The integral (33) can be evaluated by using Wilks’s formula for the moments of 
the generalized variance.* The result is 


Sala, t) = K | (35) 
where 6;; = 1 if i = j and 0 otherwise, and where 


K = 


* Wilks (1932). 
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232 On Generalized Analysis of Variance 
From (34), (35), (31) and (30) we obtain 


Pp l 
L( 4a?) = K exp 8,,-ia¥ Viti dt, 
J i=1 i=1 
whence 


i=1 i=1 


It may be observed that the right-hand side of (36) involves no sum of n terms; 
hence it is particularly useful when we wish to make n approach infinity. 
We may calculate the moments of V by means of (36). Thus 


Pp 2 
i=1 


(37) 


2 p 2 
i=1 i / \i=1 


4D 
i+j 
(n—p—1)(n—p—3) (n—p)(n—p—1) (n—p—3) 
ow pn, +2 2(p—1)(m,—1) 
pm +2) 2(p—1)(n,—1) 


38 
(w= 

Remark. Consider again the case of k samples (see Example on p. 221) and 
write V,, and Y,, correspondingly. It is seen from (37) that the statistic 


(M—k—p—1)¥,—p(k—1) 


is an unbiassed estimate of Y,,. We may regard J, as a generalization of V,, which is 
the ‘‘Studentized” D? statistic* but for a constant factor. V, is, of course, also 
identical with Hotelling’s generalized “‘Student’s” ratio except for a constant 
factor. 

We shall next study the behaviour of V as n->0o. As n is now allowed to vary, 
we shall attach to various letters the index n. In particular we write Y, for ¥ 
to emphasize the fact that the value of Y depends also on n, a fact which is not 
brought out by the formal definition of Y. This is because the 7;,, as we explained 


on p. 221, are themselves linear functions of the original population constants 
with coefficients depending upon n. 


* Bose and Roy (1938). 
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It is in general true that, except when A, is true, in which case ¥,,=0, ¥,, is 
either O(1) or O(n) as n->00. That both are possible is seen in the Example on 
p. 221 for k = 2; W, is O(n) or O(1) according as m, = m,—>0o0 or one of m,, m, 
remains fixed while the other tends to infinity. 


THEOREM 4. If Y,, = O(n), then the mean of V is + (|) and the variance 


of V is ol; . Hence the random variable V — W ->0 in probability. 


This is an immediate consequence of (37) and (38). 

If ¥,, = O(1), and if Y,, tends to a limit as n> 00, the limiting distribution of 
nV will be that of S (cf. (2)) with Y replaced by its limit. This is our next theorem. 
We assume now that 

lim = ¥,. (39) 
The case Y,, = 0 (i.e. Hy is true) is covered by (39) on putting Y%, = 0. But (39) 
may also be regarded as covering the case Y,, = O(n) in the sense that the repeated 
sampling is not made from the same population, but from populations whose 
distribution constants vary with n in such a way that (39) holds true. Thus, for 
example, in the case of two samples with m, = m, (see p. 222) we have 


= }(n+2)9, 


where P= — (E31 — 

Here we have Y,, = O(n) if the population constant ®>0. Y,, will be O(1) if we 
consider ® as decreasing to the order n~ as n increases indefinitely. This idea of 
regarding population constants as varying with the sample size can be found in 
the works of Fisher* and Neyman.} 


THeoreM 5. Lf tends toa limit n-> then 


x 
lim Pr{nV <2} =| p(x) de, (40) 
0 
Wh yh 
where p(x) = =, + (41) 


In particular, if Vy = 0 (i.e. Hy is true), the limiting distribution of nV is that of x? 
with pn, degrees of freedom. 


Proof. Write L(x) for the Laplace transform of the probability density 
function of nV, so that 
LY(a) = &(exp(—anV)) = L(nx). 


* Fisher (1928), p. 663. 
+ Neyman (1937), pp. 169--70, (1938), pp. 70-1. 
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If we replace a by na in (36) and subsequently make the change of variables 
t;, = n~'7,,, we get 

—(n, +n) 


exp 8; —t/(2a) VA; dr, 
i=l i=1 
(42) 
where K,, is now written for K and 


We shall now find the limit of L&(a) as n-> 00. We have, using Stirling’s formula, 
lim K,, = (43) 


For every system of fixed values of the 7,;, we have 


1 
whence lim 2 =exp|—= > 8;;}. (44) 
n>o n 2521 


Calling [,, the integral in (42), we have 


n 


—exp(—3> iid) exp —a sie —t,/(2a) PALE dr 
i= i= 


+ fexp|- (+a) 3 su dr 
i=1 i=1 
= A,+B,,. (45) 
The absolute value of the integrand of A,, is less than 
f Pp 
>» sis) 
i=1 


Hence by (44) and dominated convergence, A, 0 as n->00. The value of B,, is 
easily found to be 


B,, = (1+ 2a)?" exp ). 


1+ 2a 
From (42), (43) and (45) we obtain the result 
j (n) ato 
tim (a) = (1+ vexp( (46) 


If p(x) is defined as in (41), then the integral 


p(x) dx 


is identically equal to the right-hand side of (46). Theorem 5 is thus proved on 
remembering Lemma 2. 
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7. Behaviour of W when H, is not necessarily true and n is large. We shall 
establish the following theorem. 


THEOREM 6. If Y,, = O(1), the statistics nV and —n log W tend to be certainly 
identical in the sense that 
lim (nV +nlog W) = 0 in probability. (47) 


Corollary. If ¥,, tends to a limit Py as n->00, —nlog W has the same limiting 
distribution (40) as nV as n->00, 


Remark. Besides the Corollary there is another significance to be attached to 
Theorem 6. Since the test functions V and W are not functionally related (except 
when p = | or n, = 1), there is the question of choosing one of them to be con- 
sistently used in carrying out the actual tests. This can only be decided by a 
comparison of their power* to detect the falsehood of H, when the 7;, do not all 
vanish. While this may be a difficult problem for small samples, Theorem 6 appears 
to have answered the question for large samples. In fact, if n is large, it is almost 
certain that the values of nV and —nlog W calculated from the sample will differ 
very little. That —nlogW and nV tend to have the same power function is 
a consequence of the Corollary. 


Proof of Theorem 6. For every 0 such that 0<@< 1 we have 


6 
Ht log (1-49) < 


Hence, remembering (12) and (13), 


l 6; 2] 6; 2 
0<V+logW< <3(2 i “5) = 3V?, 
whence, for every 7 > 0, 


Pr{|nV +nlog W| >} = Pr{nV + nlog W Pr{5V*> 


because of (38). This establishes (47). 
Proof of the Corollary. Let 
F, (x) = PrinV <x}, G,(x) = Pr{—nlog W <2}. 
Then for every x and 7 > 0 we havet 
0<G,(x)— F(x) < +9) — 9) + Pr{nV +nlog W > 7}. 


* Neyman & Pearson (1936, 1938). 
+ Fréchet (1937), p. 164. 
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Let F(x) be the limit of F(x) as n->00, which is known to exist by virtue of 
Theorem 6. We have then 


0< G,(x)— F(x) <| 
+ F(x+)— F(x—y9)+ Pr{nV +nlog W > 7}. (48) 


Given any ¢>0, choose and fix an 7>0 so small that F(x+)— F(x—y)<e. By 
(47) the rest of the terms in the right-hand side of (48) is smaller than ¢ for all 
sufficiently large n. Hence G,,(x)— F,(x)->0, i.e. G,(x)—> F(x), which completes 
the proof. 


SUMMARY 


The Wilks-Lawley hypothesis concerning population means of multivariate 
normal populations is put in the canonical form H, (§ 1), and, assuming the popula- 
tion variances and covariances known, the test function S is derived together with 
its exact distribution (§2). In the case where the population variances and co- 
variances are unknown, two possible test functions, denoted by V and W are 
considered (§§3 and 4), and their distributions in certain special cases are given. 
In § 5 it is shown that the sample space can be so transformed that all the variables 
are independently distributed and that only a minimum number of unknown 
parameters remain. These parameters are the roots of a certain determinantal 
equation; the hypothesis H,, and the hypotheses of colinearity and coplanarity 
of populations, all specify the values zero for al} or some of these parameters. 
Returning to the test functions V and W, it is shown that as the sample size n 
increases indefinitely the two functions nV and —mnlog W tend to be certainly 
identical ((47), §7), and both of them tend to have the same distribution function 
as S ((40), §6). 
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THE DERIVATION OF THE FIFTH AND SIXTH 
MOMENTS OF THE DISTRIBUTION OF 6, IN 
SAMPLES FROM A NORMAL POPULATION 


By C. T. HSU anp D. N. LAWLEY 
Department of Statistics, University College, London 


1. INTRODUCTION 


THE method of combinatorial analysis was first introduced in 1928 in a paper by 
R. A. Fisher (1929). In this paper Fisher defined new symmetric functions 
ky, ka, ks, ... of the observations for samples of a given size, and gave simple rules 
for determining the cumulants or semi-invariants of their joint sampling dis- 
tribution. This method is especially valuable for the case where the sampled 
population is normal, and in this case its use reduces considerably the labour 
of deriving the higher sampling moments of the distribution of product 
moment statistics. Two further papers appeared later, one a joint paper of 
R. A. Fisher and J. Wishart (1931) in which further rules were given, the other 
by Wishart (1930) in which he described applications of the theory and gave a list 
of higher order formulae for the normal case. By means of these formulae 
E. S. Pearson (1930) was able to derive the first four moments of the sampling 
distribution of the statistics ./b, and 6, for the case when the sampled popula- 
tion was assumed to be normal. 

The object of the present paper is to derive formulae for «(4°) and «(4®) 
(quantities which are defined below) and then to use these to determine the fifth 
and sixth moment coefficients of the distribution of b,. This work is preliminary 
to further investigations regarding the sampling distribution of this expression. 

If x1, 2%, ...,2, be a sample of size n from a given population, then adopting 
Fisher’s definition of the symmetric functions k,, k,, k3, ky, we have 


k, = 


n2 


(n—1)(n—2)""” 


where m, % =~ Sz. 
1 1 


The coefficients k, are chosen so that H(k,) = x,, where x, is the rth cumulant of 


the given population, and the pth cumulant of the distribution of k, is as usual 
denoted by x«(r?). 
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k(r?) is expressible as the sum of a number of terms of order pr consisting of 
powers and products of the coefficients K,, Kg, ..., Kpr, and to each such term there 
corresponds a number of two-way partitions whose coefficients have to be 
evaluated. As in this paper we are, however, assuming the sampled population 
to be normal, the only non-vanishing cumulant is x,, which is equal to a, the 
variance of the distribution. The only term, therefore, which has to be evaluated 
is the one containing a power of ky. 


2. THE DERIVATION OF x(4°*) 

For a full explanation of what follows the reader is referred to the three 
papers by Fisher and Wishart already cited. 

To determine the formula for x(4°) we must find the coefficients of all the two- 
way patterns for the term in x}°. There are altogether five such patterns, which 
are given in Wishart’s paper, pls they are reproduced below. In each pattern it 
will be noted that there are five corners with four arms attached, each corner 
representing a k,, and ten connections between the pairs of arms, each 
connection representing a x. 


A B Cc D E 
Fig. 1. 

For each of these five patterns, we must determine (1) the numerical coefficient, 
(2) the n-coefficient. 

It is not necessary to give in detail all the working required for doing this, but 
we shall give two examples for each process and also a summary of the results. 

(1) The numerical coefficient is obtained by enumerating all the ways in 
which the pattern can be connected up, regarding as a separate entity each corner 
and also each arm attached to that corner. 

(a) Consider first the pattern A. 


There are } x 4! = 12 ways of aanaliinis the cyclic order of the five corners. 


There are six ways of choosing a pair of arms belonging to corner P to connect 
up to Q, and similarly for the other four corners. 


16-2 
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Finally, there are two ways of joining up the double arms between each pair of 
adjacent corners. The numerical coefficient of A is therefore 12 x 6° x 25 = 12 x 12°. 
(6) Consider the pattern D. 


Fig. 3. 


There are five ways of choosing which corner is to form the centre 7’. 

There are three ways of choosing the pairs of corners P, Q and R, S which are 
linked up by double arms, and two ways of connecting up these two pairs. 

The arms of corner 7’ may be arranged in 4! = 24 ways, while those of each 
of the other corners may be arranged in 12 ways. 

Finally, there are two ways of connecting up each of the two double arms. 

Hence the numerical coefficient* of D is 


5 x (3 x 2) x (24 x 124) x 2? = 240 x 12°. 

We may remark that the five normal patterns for x(4°) can all be obtained from 
those of «(4*) by the insertion of a fifth corner. This fact allows us to derive the 
numerical coefficients by a different method, which may be used as a check. This 
method must, however, be employed with care as otherwise it is liable to suggest 
a wrong result. We shall illustrate it by considering again the pattern A. 

The pattern A may be formed from the smaller pattern which has a numerical 
coefficient of 62208, by breaking one of the double arms and inserting a new 


four-way corner. 


Fig. 4. 


The break may be made in four places and the arms of the new corner may 
then be joined up in 24 ways. The numerical coefficient of A would thus at first 
sight appear to be 62208 x 4 x 24 = 24 x 12°. 

It must, however, be remembered that when for instance the break in the 
smaller pattern is made between P and Q, the broken pattern corresponds to two 
arrangements of the unbroken pattern, since there are two ways of joining up 
the double arms between P and Q. Hence the true value of the numerical coeffi- 
cient of A is in fact half the above number, i.e. 12 x 125. 


* There is apparently a mistake in Wishart’s paper, the numerical coefficient of D being given 
there as 120 x 125. This will of course mean that the approximate value of «(4°) found in that paper 
is incorrect. 
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(2) The n-coefficient. 

In a previous paper (Fisher & Wishart, 1931) rules were given for 
deriving the n-coefficients of a given pattern from those of non-vanishing 
patterns of a lower order which are already known. Thus we can derive the 
n-coefficient of A, which is one of the patterns of x(4°), from those of two — 
of the patterns of x(4*). This is illustrated in Fig. 5, the Greek letters below 
each pattern are used to denote the n-coefficient of that pattern. 


Fig. 5. | =| 
| 


Using the rule for adding a new k,, we have 
(—1) n(n +1) 
and we also know that 


n(n +1) n*—8n3 + 21n?—1l4n+4 
(n (n— (n—3)8 


n> 


Hence we obtain the result 


n(n +1) 
This is the n-coefficient of A. 

Similarly for pattern B. 


a= — 11n> + 45n4 — + 70n? — 36n + 8}. 
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Here we have A= 
n 
n(n+ 1) (n?—5n + 2) 
+ 1)? (n? — 5n + 2) 


(n—1)* (n — 2)8 (n— 3)" 
This is the n-coefficient of B. 


The complete results are as follows: 


Numerical n-coefficient 
Pattern coefficient/125 +n(n+1)/{(n—1)* (n—2)4 (n—3)} 
A 12 n®—11n5 +45n*— 85n3+70n®—36n+ 8 
B 80 n®— 9n§+23n!— 7Tn?—28n?+ 12n 
Cc 120 n® —12n°+52n*— 94n?+59n?—30n+ 8 
D 240 n® — 13n5 + 59n4 — 103n? + 48n2—24n+ 8 
E 32 n® — 15n5 + 80n4 — 160n? + 75n? —41n+ 12 


The formula for «(4°) is calculated by finding the sum of products of the 
numerica' coefficients with the n-coefficients. We thus obtain the result 
4x 125 n(n +1) 


{121n® — 1473n5 + 6335n4 
10,675n3 + 4900n2 — 2536n + 840} 


3. THE DERIVATION OF «(4®) 
There are altogether 17 normal! patterns for «(4°), which are shown in Fig. 7. 
All the patterns except the last two (R and 8) may be obtained from those of 
«(4°) by the insertion of a sixth corner, while the patterns R and S may be obtained 
from those of «(4*) by the insertion of a pair of corners. The numerical coefficients 
may as before be calculated in two ways and we shall content ourselves with a 
summary of the results. We shall, however, give two examples of the derivation 

of the n-coefficients. That for the pattern A is shown in Fig. 8. Here we have 
n(n+1) 


] 
B= in—1 {n®— 11n5+ 45n4 — + 70n? — 36n + 8}, 


n n n 
n* 


Hence a 


—310n? + 212n?— 88n + 16}. 


This is the n-coefficient of A. 
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Similarly for the pattern S. 


r Vv 
Fig. 9. 
n— 
+ 1)? 


(n— 2) (n— 3)?" 
Hence the n-coefficient of S is 


a n3(n + 1)8 
(n= (n—2)8 (n— 


The complete results are given in the following table. Here c denotes the numerical 
coefficient divided by &, where k = 214 x 34 x 5 = 6,635,520 and a, is the coefficient 


of n” in the expression for n-coefficient divided by N, where 


n(n +1) 
N= 


Table of coefficients 


Pattern c ag ag as as a, a, ay 
A 27); 1) -14 78 | —220 341 | —310 212 | — 88 | 16 
B 216} 1)| —12 54 | —100 33 96 | — 80 24 0 
Cc 324 | 1 | —15 88 | —254 373 | —279 178 | — 76 | 16 
D 324} 1] —14 70 | —140 53 138 | —116 24 0 
E 162} 1} —15 90 | —266 389 | —267 160 | —176 | 16 
F 648 | 1 | —16 99 | —292 403 | —236 153 | — 64 | 16 
G 216; —16] 104 | —324 449 | —208 110 | — 52 | 16 
H 432} 1] —-13 60 | —106 17 1ll | — 62 24 0 
J 1296 | 1 | —16]} 100 | —300 421 | —236 126 | — 64 | 16 
K 216 | —16] 103 | —320 451 | —220 101 | — 52 | 16 
L -—16 99 | —292 403 | —236 153 | 64 | 16 
M 2592 | 1 | -—17] 111 | —338 451 | —193 101 | — 52] 16 
N 432} 1] -13 60 | —106 17 lll | — 62 24 0 
P 1296 | 1 | —18 | 129 | —442 675 | —342 195 | — 94 | 24 
Q 432} 1] -—19 | 144 | —6521 825 | —385 210 | —107 | 28 
R 72)1/)])-8 18 4] — 47 12 36 0 0 
Ss 14),1)-8 18 4| — 47 12 36 0 0 


tel 
\\ = 
4 
3 
> 
| 


C. T. Hsu D. N. 245 
The formula for x(4°) derived from these results is 


n(n +1) 
(n — 1)° (n— (n—3)° 


— 2,775,248n + 3,759, 853n4 — 1,717,116 + 946,872n? 
— 476,064n + 136,080} x}?. 


= 6,635,520 


{9025n§ — 145,532n’ + 920,610n® 


4. THE MOMENTS OF THE DISTRIBUTION OF 6, 


The formulae of x(4*), «(4°) and «(4*) have been found previously and we will 
give the results: 


24n(n + 1) 
1728n(n + 1) (n?— 5n + 2) 
«(48) = (n = 1) (n— 2)2 (n— 3) xg, 
1) (5Bn*— 428n8+ 1025n*— 474m + 180) 


(n— (n—2)8 (n—3)8 
We may now calculate the moments of the distribution of k, as follows: 

u(4*) = x(4?), 

H(48) = x(4°), 

(48) = (44) + 342(42) 


__-1728n(n + 1) 
(n— 1/8 (n—2)8 (n—3)° 


= K(4°) + 10x(4?) 
124. 4n(n + 1) 
~ 


{n> + 207n4 — 1707n? + 4105n? — 1902n + 720} x8, 


{5n? + 1402n* — 75,870n4 


— 128,205n + 59,000n? — 30,492n + 10,080} «2°, 
= + 15«(44) + 10K2(48) + 15«3( 4?) 
124, 10n(n +1) 
(n—1)° (n—2)°(n—3)° 
+ 29,339,555n® — 88,717,430n5 + 120,300,577n4* — 55,075,788n3 
+ 30,365,028? — 15,250,464n + 4,354,560} 


The quantity 6, is defined to be m,/m3 and, using the definitions of k, and k, given 
in § 1, we see that 


+ 770n® + 278,359n8 — 4,603, 808n7 


_ (n—1)(n—2)(n—3) ky 3(n—1) 
_ (n—2)(n—3)k, 3(n—1) 


bs 
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Thus the average value of b, is 3(n —1)/(n +1) and, for r>1, the rth moment of 
the distribution of b, is given by 


(n —2)(n—3))" 
= 


(n—3)(a—3))’ 


Now Fisher (1930) has shown that 
(n—1)...(n+2r—3) 


p(... 54°34) = p(... 5°4°342-7) Kb, 
and in particular 
4r—3 
= (472-27) (n—.l) (n +2) r ) 
Thus = (n—1)" (n—2y (n—3/ 


(n+ 1)" (n—1)(m+1)...(n+4r—3) 
Hence, finally, we obtain the following results: 


24n(n — 2) (n—3) 


Halbs) = 3) 5)’ 
= _2728n(n-2) (n—3) (n?—5n +2) 
= (n +3) (n +5) (n +7) +9)’ 
(b,) 1728n(n —2) + 2074 — 1707n3 + 4105n* — 1902n +720) 
= (n+ 
124. 4n(n — 2) (n—3) 
17,516n5 + 75,870n* 
+ 1402n$ -- 17,5165 + 75,870n' 
— 128,205n? + 59,000n2— 30,492n + 10,080}, 
4 
= 2) 3) 110.5. 770 + 278,359n° — 4,603,808n? 


(n + (n+ 3) (n+ 5)... (n+ 21) 
+ 29,339,555n® — 88,717,430n5 + 120,380,577n4 — 55,075,788n5 
+ 30,365,028n? — 15,250,464n + 4,354,560}. | 


5. CHEcK with McKay’s RESULTS IN THE CASE n = 4 


Since the formulae which we have obtained are somewhat complicated, it is 
desirable to provide some sort of check on them. This is supplied by using the 
results obtained by McKay (1933), who found in an exact form the distribution 
of b, for n = 4, i.e. for samples of size 4. 
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Putting x = 6,, the distribution function f(x), of x, is given by 
flz) = — 431; when 1<2x<2, 
9x—17 
I~ 
The moments of w = }b,— } about the origin are given by 


where 


1 
2° 
Hence v,(w) => 
= 3.77 21° 
61 
= 
277 
va) = 
79 
= 


46,889 
ve’) = 


The moments about the mean of w are given by 


= 3.52.7 
— 48 —3.2¢ 
= 5*.7.11.13° 
1424 _ 89 
Ma) = 5*.7.11.13.17” 
(w) = = 27.38.37 
MAO) = 53.17.19 65.17.19" 
76,096 28.1189 
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; Thus for the moments of b,, we have 
palbs) = = 
palb,) = = 
: = = Hi 17’ 


238.1189 
= 46 é 
¥ It may be easily verified that the-same results are obtained by putting n = 4 
4 in the formulae given at the end of the preceding section. There is thus reason to 
- believe that these formulae are in fact correct. 
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TESTING THE HOMOGENEITY OF A SET OF VARIANCES 


By H. 0. HARTLEY 


1. [IyTRODUCTION 


WHEN analysing data the experimenter is frequently faced with the necessity 
of testing the homogeneity in a set of estimated variances. When it is desired to 
combine a number of variances to obtain an estimate of the common variance 
it is necessary to apply such a test. Again, if a selected ‘“‘treatment mean square”’ 
is to be compared with an “error mean square’’, a test for homogeneity has 
recently been proposed (Wishart, 1938) as a safeguard against the selection of 
the largest mean square frora a set of random ones. 

For general use in such cases Neyman & Pearson (1931) have suggested a test: 
the L, test. The statistic L, used in this test has been modified by Bartlett (1937) 
and generalized by Welch (1935, 1936). From recent work (Nair, 1938; Bishop 
& Nair, 1939; Pitman, 1939) it would appear that Bartlett’s statistic ~ is the 
best to use, because it is unbiased in the sense defined by Neyman & Pearson 
(1936, 1938). Or more precisely, the L, test in its original form is biased with 
regard to the admissible set of alternatives. 

Some difficulty has been experienced in obtaining the random sampling 
distribution of this statistic which is required for a test. Various approximations 
have been worked out. There are 

(a) Bartlett’s (1937) approximation using the y? distribution. 

(b) P. P. N. Nayer’s (1936) approximation obtained by fitting Pearson-type 
curves to the distribution in the special case where all mean squares are based 
on the same number of degrees of freedom. 


(c) U. 8. Nair’s (1938) expansion of the exact distribution in the special case 
mentioned in (5). 

(d) Recently another paper on the subject has appeared by E. J. G. Pitman 
(1939). In this paper the author transforms the distribution of L, into a multiple 
integral which can be evaluated in special cases (small values of k) by reduction 
to elliptic integrals. 

The accuracy of the approximations (a) and (6) has recently been tested 
(Bishop & Nair, 1939) in the special case in which the expansion (c) is available. 
Bartlett’s findings were confirmed; it was shown that his approximation is valid 
only for moderate or large numbers of degrees of freedom (> 3). We shall also 
show in this paper that even with this restriction for the degrees of freedom the 


approximation is not very accurate if k, the number of mean squares in the set, 
is large. 
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While U. S. Nair’s expansion, although it is very complicated, provides 
a means of working out the exact probability integral in the special case where 
all mean squares are based on the same number of degrees of freedom, there is 
still uncertainty in the general case. P. P. N. Nayer has suggested that the test 
for homogeneity between & mean squares with f, degrees of freedom (¢= 1, 2, ..., &) 
is (under certain conditions) identical with testing the homogeneity between k 
mean squares all of which have f degrees of freedom, where f is the arithmetic mean 
of the f,. We shall show that, although there is some truth in this statement, 
the harmonic mean should be used for f rather than the arithmetic mean. 

Since Bartlett’s approximation does not provide a test of sufficient accuracy 
in all cases, the main difficulty of dealing with the general case has been the 
large number of quantities on which the exact distribution depends: if the k mean 
squares in the set have f, degrees of freedom respectively (¢=1,2,...,4) the 
distribution would depend on k + 1 quantities. We shall now show in this paper 
that (provided f,>2) there is an approximation of sufficient accuracy which 
depends on three quantities only. These three are: 


(i) &, the number of mean squares in the set; 


(ii) a= where = 
| 1 

(ii) 


This makes the distribution amenable to tabulation, so that the test can be re- 
duced to an inspection of a table of 5 % and 1 % points which can easily be carried 
out by the experimenter. 

In the case where mean squares having one degree of freedom occur in the 
set, the distribution is of a more complicated character, but our approximation 
is still fair. 


2. THE FORMAL SOLUTION 


Consider k normal populations with variances o7 (t= 1, 2, ...,4). Let s? be an 
unbiased estimate of o? based on /, degrees of freedom, and let us denote by F 
the total number of degrees of freedom, 


k 
(1) 
Bartlett’s statistic ~ is then given by 
—2logz = Plog — si. (2) 


The equivalence to a special case of the generalized L, statistic (Welch, 1935, 
1936) is expressed by the relation 


PlogL,=2logw, (3) 
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For our test we require the random sampling distribution ¢(;) of the statistic L; 
under the null hypothesis 


i=o*, ¢=1,2,...,&. 
Under these conditions it has recently been shown (Welch, 1936) that the (q—1)th 
sampling moment of L; is given by 
1 


From general principles it may now be inferred that equation (5) is valid for 
all complex q with 


Real Part of q> 1. 


Further, by Mellin’s inversion formula, we obtain from (5) 


k 
= 


x =, r| )| (6) 


where Q (> 1) is an arbitrary positive quantity. 
Introducing as a new statistic 
(7) 
and as a new variable of integration 
A=$+(q-1)/F, 


we obtain for the distribution function of x (y(x) say) 


k 
wee) = (F) 
t=1 


x — — > 
where A is an arbitrary positive quantity. 
Using now Binet’s integral representation of log /’ (Whittaker & Watson, 
1927, p. 249), we may write equation (9) in the form 
A+io 


+t 


T 
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Introducing g(t) = (5 (12) 


we see that g(7) has continuous derivatives of any order for 0<7<0o. We may 
therefore transform the integral (11) by integration by parts, differentiating 
g(7) and integrating the exponential functions. We obtain 


L © d39(r) k 


3 dr? “ (13) 


We now approximate to E(A), and therefore to y(z), by ignoring the last summand 
in equation (13), and write 


4. 


E(A) = TA (14) 


It can be shown that this approximation is sufficient for all practical purposes 
provided 


¢=1,2,...,&. 


Substituting (14) in (10), expanding e® and integrating the single terms 
we obtain* 


i=0 
where the a, are the coefficients of the expansion 
in ascending powers of t. 
From (16) it is obvious that (to the degree of accuracy considered) the distribu- 
tion of x is a weighted sum of x? distributions with degrees of freedom ranging 


between k—1 and oo. We now denote by P(X) the probability integral of x? 
based on j degrees of freedom, i.e. we introduce 


fa\-1 
P(X) = r(3) 2-4 | (18) 
x 
We further denote by P(X) the probability integral of our variate x defined 
in (7), viz. 
P(X) = “weeds (19) 
x 


* We make use of the well-known integral representation of 1/I"(z), viz. 
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From equation (16) we obtain by integration 


P(X) = ( a) (20) 


where the a; are the coefficients of the expansion of 


— axes (21) 


in ascending powers of ¢. 


3. TABULATION OF PERCENTAGE POINTS 


Equation (20) provides a means of calculating tables of the probability 
integral P(X) (or its 5% and 1% points). For the quantities P(X) are given 
by Elderton’s tables of the probability integral of x*, while the coefficients a; 
are readily obtained from the expansion of (21). 

For practical purposes tables of the 5% and 1% points could be prepared. 
These percentage points would depend on three quantities, viz. 


1 
The effect of c, is small, and it would be convenient to make k and c, the respective 
row and column headings of two-way tables of percentage points, and to prepare 


such tables for two or three selected values of c,. It is hoped to prepare such tables 
shortly. 


4. COMPARISON WITH U. S. Natr’s EXPANSION 


It would lead us too far afield if we gave here a complete mathematical proof 
of the accuracy of the approximation (20). It is, however, of interest to check 
the accuracy in a few cases numerically. U. S. Nair’s expansion mentioned 
above will be used for this check. The most stringent test of the accuracy of 
equation (20) is established by choosing the f, small and & large. Numerical 
results have been obtained from U. 8. Nair’s expansion (2) in the case f, = f = 2; 
k = 10. The result of the test is given below. 


Lower percentage points of L, = e~X!¥ 


5% point | 1% point 


(a) Bartlett’s approximation 0-367 0-277 
(c) U.S. Nair’s expansion 0-375 0-288 
(d) Equation (20) 0-378 0-291 


The agreement between U. S. Nair’s expansion (c) and equation (20), (d) is 
satisfactory in this case where the approximation would be expected to be worst. 
For comparison, Bartlett’s approximation (a) is also shown. 
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5. THE RELATION BETWEEN THE SPECIAL CASE f,=f, t=1,2,...,k 
AND THE GENERAL CASE 


P. P. N. Nayer has considered the general case, and has provided some evidence 
for believing that this case can be reduced to the special case provided that 
the f, are not too small and not too dissimilar in value. He has suggested 
using the mean of the f, as a substitute for the common value f. It is easy to see 
from the approximation (20) that there is some truth in Nayer’s conjecture. 
However, it is not correct to use the arithmetic mean. The correct value is given by 


and is approximately equal to the harmonic mean of the f,. If all f,> 4, the general 
case of unequal f, can always be reduced to the special case f, = f, no matter 


how dissimilar the f,. For if in equation (21) we replace c, by ;'gc,, and consider 
the function 


= 


we find that the coefficients of this function when expanded in ascending powers 
of t will be approximations of sufficient accuracy to the coefficients «,; in (20). 
The probability integral of X is therefore determined by the quantities k and c,, 


so that the identity of the general case and the special case is obvious, provided f 
is defined by (22). 


6. SOME REMARKS ON BARTLETT’S APPROXIMATION 
Bartlett (1937) has given an approximation to the distribution of 
—2logy =z. 
He suggests as an approximate test that we enter the table of x? for k— 1 degrees 
of freedom with the statistic 
32(k—1)/c,, 
where ¢, is given by (15). 

It can be shown that this approximation is equivalent to equation (20) 
provided 4c, is small, so that higher order terms in the expansion (20) may be 
ignored. For large or moderate values of 4c,, however, discrepancies may occur 
even if all mean squares are based on moderate or large numbers of degrees of 
freedom. We shall confine ourselves here to demonstrating this with the help 
of a single example, viz. f = 5 and k = 30. While for values of k of this order 
U.S. Nair’s expansion is very complicated, equation (20) yields results which are 
accurate to 3 figures. Below are given the probabilities of exceeding Bartlett's 
5% and 1 % values; they are 

5% level 1% level 
True P(X) 0-047 0-0081 


Thus Bartlett’s approximation has an error of 6 % and 19 % respectively. 
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THE SIMULTANEOUS DISTRIBUTION IN SAMPLES OF 
MEAN AND STANDARD DEVIATION, AND OF MEAN 
AND VARIANCE 


By L. TRUKSA 
The Charles University, Prague 


In this study I propose to give the application of “the conception of the probability 
of passage”’ to the solution of the rather difficult general problem mentioned 
above. From this single example itis possible to deduce that the introduction of 
“a conception of the probability of passage’’ into mathematical statistics would 
at least make the solution of a range of difficult problems considerably easier. 

Let us assume that the class symbol of the statistical element is a continuous 
two-dimensional variable z, y, defined in the region 2, and that the corresponding 
density of the probability of passage from the class x,, y, into the class x, y 
expressed by the symbol p(x, y,; x,y), depends, not only on the variables x, y 
but also on the discontinuous variable ¢, which is the number of operations 
executed on the statistical element. -By operation we shall mean in our case the 
selection of a statistical element from the fundamental universe. 

Let the function p(x,, y,; 7, y) satisfy the following fundamental relationship: 


fede dxdy = 1. (1) 


A further relationship, which will be used, concerns the calculations of the 
continuous two-dimensional probability distribution F,, (x, y), corresponding to 
the number of operations ¢+1, from the distribution F(x,,y,) by means of 
PAX, Yrs XY): 
y) = Yr) Yr; y) dx, (2) 


The problem of the simultaneous distribution of the mean and standard 
deviation of samples in the case in which the fundamental distribution is given 
quite generally by a function f(x), has, so far as I know, occupied the attention 
of only one man; this was A. T. Craig (1932) in his study: ‘“‘The simultaneous 
distribution of mean and standard deviation in small samples.’’ He introduces 
the solution only for samples of a very small number of items, n = 2, 3, 4. 

The use of the conception of the probability of passage enables us to demon- 
strate successively the solution of this problem for samples with an increasing 
number of items. The method used gives us at the same time a solution in a very 
easy manner, and especially clear, if used with a graphical illustration. 
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I 


Let us use % for the mean of a random sample of the symbols 2, x, ... x, taken 
from the continuous one-dimensional universe of density f(x), where we have 


1? 
(3) 


and for the standard deviation s, which, in accordance with the definition of the 
standard deviation, is given by 


z)2 
(3-1) 


By an extension of this sample of size ¢ to the size ¢ + 2, we get a sample, the mean 
of which is 


x ] 
(3-2) 

and the standard deviation S is given by 
1 t+2 x 


The elementary probability of passage p,%,s; X,S) from the sample with = 
as mean and standard deviation s to the sample of mean X and standard deviation 
S equals the probability of the appearance of the values x,,, and 2,,.; thus 


141) = Az, 83 X,S8)dXd8. (4) 


In this expression it is necessary to substitute for the variables 2,,,, 2,,. in 
terms of the variables X, S; the values %, sin substituting being taken as constants. 


From the expressions (3) and (3-2) we obtain, first of all, the following relation- 
ship: 
= X(t+2)— zt. (5) 


A further relationship which we obtain from equations (3-1) and (3-3) is 


t 
(t+ 2) 8? = ~ (x; —X)? + + XP 


t 
= UXT+ (t+ 2) X24 + — X. 
1 


If we now use the relationship 


t 
ts? = > x? — 
1 


we obtain another equation, which is necessary for the determination of the values 
and 2,9, i.e. 


+229 = (t+2) +z). (5-1) 
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258  Simulianeous distribution of mean and standard deviation 
From equations (5) and (5-1) it then further follows that 


/{2[(¢ + 2) S*—ts*] —t(t + 2) zy} = x 52 


where, for the sake of brevity, we introduce the symbol 
a = + 2) S*—ts?] —t(t + 2) (X 

For effecting the substitution in expression (4) it is also necessary to know 
the corresponding determinant of the substitution 
_ (t+2)?8 

Referring to the two different values for each of the symbols z,,, and 2;,. 

we then get the expression for the density of probability of passage p,(Z,s; X, 8) 


in the form 


Let F{Z,s) be the density of the simultaneous distribution of mean % and 
standard deviation s in the random sample of size t. 

The density of the distribution of mean X and standard deviation S in the 
random sample of size t + 2 is obtained by application of the fundamental relation- 
ship (2), and for its value we get the following expression: 


FlX,8) =| X, 8)deds 


For the complete solution of this recurrence relationship, besides the limits of 
integration (which we shall consider later on), we need to know the initial values 
of the function F(X, S), i.e. 


F(X, 8) = f(X), 
8) = po(%,s; X, 8) = 4f(X + 8) 


Let us supplement these values with the following function F,(X, 8) for which 
only one integration is necessary: 


(7-1) 


dz 

3(X —z)}’ 

the limits of integration will be deduced later on in this study. 
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In order to examine the limits of integration, 

(1) Let us first of all assume that the density of the fundamental universe is 
expressed by the function f(x), defined between the limits +00. 

In this case we set only one condition on the values 2,,,, 2,2, that is that they 
must be real. The condition corresponding to this is expressed by the inequality 
2(t + 2) S?— 2ts? — + 2) (X 

If we consider X, S as constants and %, s as variables and we use rectangular 
co-ordinates with axes %, s, these variables are limited to the region given by that 
half of the elli ; 


which lies above the axis Z. According to the choice of the values X, S, the variable 
% varies between the limits +00; the variable s then lies in the range 0, 00. 


Diagram 1 


The centre of the ellipse EZ, lies on the axis of Z at a distance X from the origin 
of the co-ordinates; the semi-axes have the lengths 


t+2 2 


The integration of expression (7) with respect to ¥ must therefore be carried 
out between the limits 


{ 2s? 2S? 2% 
and then with die tos een the limits 
(0. 
t 
In Diagram 1 the surface of integration is shown for the particular values X = 1, 
S=1-5,t=4. 
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2 

The ellipse Z, intersects the % axis in the points X + 8 J 7’ the semi-major 

axis A, being always larger than A,, their ratio 
A 

depending only on the size of the sample. 

If calculation of the value of F,(X, 8), is involved, i.e. if t = 1, the ellipse Ey 
reduces to that part of the Z axis with X + S/2 as end-points. The corresponding 
integration need only be carried out with respect to ¥, and that between the 
limits X+8,/2. 

Formula (7) can be used, not only for the successive calculation of the simul- 
taneous distribution of the mean and standard deviation for an increasing size 


of random samples, but also for thé verification of the given distribution values 
introduced for the arbitrary ¢. 


Thus, for example, it is possible to check the correctness of the expression 


t-1 
= ts? 
F(X, 8) = 


_ tx? 

2 

which corresponds to the normal universe 


fa) 


By application of formula (7) we obtain 


t-1 
t\ 2 /(t+2\! _(t+2) 
Fudd, 8) = (5)* 
+2 
r( 


dz 
st-2ds S22 


{-1 

Ss (t+ 2) (X*4+8*) 
= (5) (= ) e 2c* 

c 


t+2 
+ 
V(t+ 2), t+2 + St 
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As can be seen, the function F(X, 8) satisfies the fundamental recurrence 
relationship (7). 


(2) Let the fundamental universe be defined by a continuous function defined in 
the range 0, 00. 


Under this assumption the values 2,,,, 2,2, must satisfy, apart from the 
condition of their being real, another condition expressed by the inequality 


/{2(t + 2) 2ts? — t(t + 2) (X >0; 
the mean of the ais % cannot then assume negative values, which gives 
E>0. 
The corresponding values of the variables %, s, satisfying the first inequality 
lie outside the ellipse FZ, 


2 


S* = 0. 


The centre of the ellipse is situated on the % axis at a distance X(t + 2)/(t+ 1) 
from the origin, the semi-axes have the lengths 


From the eusiee S x, = ,(t+ 1) it follows that the semi-axis Aj is longer than 


A;. The ellipse #, intersects the % axis at the points 


t+2 
(s 


The ellipse 7, lies inside the ellipse Z, and touches it at the point %,, s,, where 


as long as s>X 
From the expression for the lengths of the semi-axes it follows that Z, is real 
only as long as the condition xX 
> 
v(t+1) 


is satisfied. 


Considering the condition that the mean % cannot attain negative values, we 
obtain the inequality S<X J(t+1). 


The system of the ellipses F, is therefore real for those points with the para- 
meters X, S as co-ordinates, which lie between the straight lines 


S = X J(t+1) 


and that for > 0, S>0. 
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When carrying out the integration indicated in formula (7), it is also necessary 


to distinguish between the cases when the ellipse Z, intersects the negative part 
of the % axis and when it does not, i.e. if 


s<X or s>X 
In Diagram 2, the corresponding regions of the values X, S are shown fort = 8. 


‘ 

s=x 


Diagram 2 


S=K 


fo” x 


The surface of integration is that part of the plane %, s in the first quadrant 
contained between the ellipses EZ, and £,. 


The process of integration is then as follows: 
(a) Let S satisfy the inequality 


In this case only the ellipse EZ, is real. It is therefore necessary to carry out the 


integration with respect to % in between the limits “t+ J (> -5) and with 
t+2 
respect to s between the limits 0, 
In order to illustrate the theory, formula (7) will be applied to find the 


analytical expression of the correlation surface F(X, S) for samples from the 
fundamental universe f(x) =e-*; O<a<a 

over the region in the plane X, S bounded by the inequality mentioned above. 
For t = 2 and 3 the results are given in A. T. Craig’s paper, namely 


FX, S) = 8) = 6/3 
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By application of our formula (7) we obtain 
F,(X, 8) = F(X, S) = 50/5 S8e5X, 
Regarding these expressions let us assume the general solution to be 
= Aye By 


A recurrence relationship for the coefficients H, may be easily found from the 
equation 


t+2 
8) = 28(t+ 2)? | H, eds = 
2 a 
= Sle 2X, 
t+2) 2 
(t—1)¢? 
It is then not difficult to verify that for t>2 
In? 
H,= 27 * t 
(=) 
The function F(X, 
represents a part of the whole distribution given by the integral . | 
x=0Js=0 
thatis,if t= 2 3 4 5 
100% c. 60 c.30% e.13% 


of the whole distribution. 
(6) If S satisfies the inequality 


<8<¥,/ (3) 


the relative position of the ellipses Z, and EZ, is shown in Diagrams 3a,b. The 
integral in formula (7) is equal to the sum of the integrals between the following 
limits: 

(x) According to 2: 


28% 2s?\ t+2 t+2 
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Diagram 3 (a) 


el 


0 


Diagram 3 (b) 
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According to s: ct 
t+2 X? 


t 


(f) According to Z: 
282 282 2s? 
x- X+ /(F-75)- 


According to s: 
t+2 : t+2 
fF 


le Diagram 4 


q 
U 

) 
i 

' 
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(y) According to 7: 


| t+1 
According to s: 
ft+2 X? 
0; 


(c) Finally if X J 3< S<X/(t+1), part of the ellipse Hy falls to the left of 
the s axis, Z, always being to the right of the s axis, the surface of integration of 
the values %, s is made up of two parts, as can be seen in Diagram 4. 


For the purpose of integration it is necessary to divide the integration into 
four parts: a, 2, y, 6. The integral of the expression given in (7) is equal to the sum 
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of the integrals with the following limits for the value %, and for s, Z to be taken 
first of all: 


(a) The limits of %: 4 
_ get +2 t+2 
The limits of s: ‘+3 
(8) The limits of z: 
x 282 t+2 & x) 
The limits of s: 
t t+1 i 
The limits of 
2S? 2s? 282 Qs2 
The limits of s: 


(6) The limits of 
A t(t+1) t+2 t+1)’ 


The limits of s: a): 


If the calculation of function F,(X, 8) is especially required, that means, if 
t = 1, that there is a substantial simplification in the integration. First of all it is 
necessary to carry out the integration only according to %; besides that it is 


sufficient to differentiate between two cases only, according to whether the 
value S satisfies 


xX xX 
or p< v2, 


since obviously the case of t+) <S<X J 3 drops out of consideration. 
If the first of the given inequalities is valid, it is necessary to carry out the 
integration with respect to ¥ between the limits: 


X—S,/2; X + S,/2; in the case of the validity of the second inequality, the 
corresponding limits of integration are: 


0; 
X+8,/2. 


These results agree with the results quoted in the paper by A. T. Craig 
already cited. 
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(3) Let the fundamental universe be defined in the range 0, a. 
The values 2,,;, X;,2, in this case are determined by the inequality 
O< 
the mean of the sample satisfies the inequality 
O0<Z%<a. 

From the upper limit of the values 2;,,, 2,2 it follows that 

t+2 

} 2) S2— 10 42) (X-2)} <a. 


The corresponding values of the variables %, s, which satisfy this inequality, 
lie outside the ellipse E,: 

Its centre lies on the % axis and is at a distance from the origin equal to 

gtt2 a 
t+1 

this distance is always smaller than the distance of the centre of the ellipse Ey 
from the origin. 


The lengths of the semi-axes are 


t(t+1) t+1 V(t+1) 
The ellipse Z, lies inside the ellipse EZ), and touches it at the point where 


as long as S>(a-X) 
If the ellipse F, is to be real, then we must have 
a—X 
+1) 


This inequality leads to the relation 
S<(a—X) 


Finally, it is necessary to consider the condition that the ellipses Z, and EF, 
must not intersect, in the limiting case they touch at the point 
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From this condition there follows the limitation of the permissible values of X, S 
by the hyperbola H af 
t t(t+ 2) 
For carrying out the integration in formula (7) it is necessary to know the 
condition for the ellipse E, to intersect the % axis at a distance A from the origin, i.e. 


The whole process of integration is quite clearly shown in the graphical 
representation of the individual segments of the permissible values X, S in Dia- 
grams 5-7, for t=1,2 and 3. It is convenient to split up the surface of the values 
X, S, for which both the ellipses E, and £, exist, into two parts by a straight line 
A = ja. 


If X < 4a, 
the semi-axis Aj of the ellipse #, is longer than that of the ellipse F,. 
If X > ha, 


the ratio of the lengths of the semi-axes is the reciprocal. 

The description of the limits of integration for %, s is lengthy in tabular form 
and is better given by means of diagrams. 

In the special case, t = 1, the integration is carried out only according to %, 
for the values X, S (see Diagram 5) in the segment: 

(x) Between the limits: X—S/2; X+S,/2. 

(f) Between the limits: 


0; and X+S,/2. 


Mes 
| 
S =(a-X) | 
Diagram 5 
S=XJ2 S=(a—X)/2 S 5) 
+ 
§ 
au 
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(y) Between the limits: 


(6) Between the limits: 
| 0; — and “+ 
3X 
3X2} and 3X +4 /{6S?-3X?}; a. 


/ Diagram 6 


+ 
a 


As a simple illustration let us find the correlation surface F(X, S) of samples 
of ¢ items drawn from the distribution 


1 
f(x) = O<a<a 


over the region bounded by the straight lines 


S= 0, S . 
V(t-1) (¢—1) 


Using the results given by A. T. Craig, 


x 4 _ 
S) a? F(X, 8S) = 
we find, by our formula (7), 
F(X, 8) = 8) = 
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and then by similar reasoning as in example on page 262 we obtain 


This function is independent of the variable X. 

As to statistical theory, the problem of the correlation surface F(X, 8) is 
solved by our formula (7) for any fundamental population f(x), but as to applica- 
tion on a special distribution f(x), I have not overcome all the difficulties of 


S 
S=X j(t+1) S=(a—X)J(t+1) Diagram 7 
a ‘ 
/ ee \ 
G s=x /? s>(a-X) 
a ‘ue 
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integration. Nevertheless, I feel, the approach being a new one, this study may be 
of interest to statisticians and I hope perhaps that some mathematician will see 
how to solve the problems that I have left uncompleted. 


II 


Let us take the density of the simultaneous distribution of a mean % and 
variance s? = u, when the size of the sample is t, as G,(%, uv). By an application of 
the same process as in Part I we get the fundamental recurrence relationship 
for the successive calculation of the values G,(%, u): 
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The corresponding initial values are expressed by the following relationship: 


G,(X, U) = f(X) 
G,(X, U) = + 


For the calculation of the next value @,(X, U) we need to carry out only one 
simple integration: 
| G(X, U) = X 4,{6U — 2u—6(X 
dz 

X — 4%—4,/{6U — 2u—6(X — zx)? = 
The limits of integration for the different ranges of the fundamental universe 
can be deduced by the method given in Part I. At the same time the three ellipses 

E,, E,, E, are replaced by the three parabolae, P,, P, and Py: 
t+2 

2 

2 


t+2),_ (t+2) 
2 2 


) 4t+2)2X+hu= 
+a(t+2) X —a? 


In conclusion I must express my thanks to Professor E. §. Pearson for advice 
and several useful suggestions. 
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CERTAIN PROJECTIVE DEPTH AND BREADTH 
MEASUREMENTS OF THE FACIAL 
SKELETON IN MAN 


By ALETTE SCHREINER, Oslo 


1. DEFINITIONS OF THE MEASUREMENTS 


In their study of the “flatness”’ of the facial skeleton in man T. L. Woo & G. M. 
Morant (1934) lay down no method for direct measurement of the transverse 
flattening of the middle part of the facial skeleton, i.e. of the part made up by the 
malar bones and corpora of the maxillae. In studying a number of crania with 
differently shaped facial skeletons, it occurred to me that the best expression of the 
degree of projection of the middle part of the facial skeleton might be obtained by 
expressing the projective distances of the zygomaxillary and zygotemporal points 
from the most posterior points on the margins of the pyriform aperture as per- 
centages of the lengths of the chords between the corresponding bilateral points. 
It is true that no precise points of general validity for this purpose can be 
indicated on the “‘nasolateral”’ margins. It is also the case that we often find 
asymmetry here, though hardly more than in other parts of the facial skeleton 
on which routine measurements are taken. However, the advantages offered by 
the most posterior points on the margins for subtense measurements appear to 
me to outweigh the disadvantages. It is a factor of some importance that the two 
subtense planes in question are almost coincident and approximately horizontal. 
After having taken some test measurements with a pair of ordinary co-ordinate 
callipers, I came to the conclusion that this instrument was unsuitable for my 
purpose. I therefore decided to undertake a preliminary research with the object 
of testing the value of the method, disregarding the disturbing factor of asymmetry. 
For this purpose I designed, and ordered from P. Hermann of Ziirich, a special 
pair of callipers with two parallel arms, both of which might be moved in directions 
at right angles to the bar to give readings of the distances of the tips from the bar. 
According to my design the tips were to be slightly blunt, and the bar was to be 
only 2 mm. thick with blunt edges on its working-face. Owing to the trouble and 
expense involved in the construction of an instrument of this kind, P. Hermann 
sent for my approval one of a set of twelve pairs of callipers which he had madesome 
years ago to the order of R. Péch, some of which were still in his possession (Fig. 1). 
In most respects the design is similar to my own, but the bar is 3 mm. thick and the 
edges of the working-face are sharp. Furthermore, the scales of the arms are fixed 
in such a manner as to necessitate correction of the readings of the subtenses, 
whereby 4-3 mm. (controlled by the Weights and Measures Standards Office, 
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Oslo) must be added to the readings. In spite of these drawbacks I decided to keep 
the instrument, and; though it is not ideal for the purpose, it can be used effectively. 
I do not know for what kind of measuring it was originally designed. 

For measuring a chord and its subtense I first set both arms of the instrument 
at equal lengths, longer than the subtense, and place the tips in contact with the 
extremities of the transverse chord. I then fasten the screw, draw back the arms, 
place the working-face of the bar lightly but firmly against the nasolateral margins, 
and move the arms until the tips again meet the bilateral points. On removing the 
instrument the distance between the arms and the subtenses are recorded, the 
latter after addition of 4-3 mm. If the readings on the two arms are not equal, as 


4 


4 


Fig. 1. Callipers used to measure the breadths and subtenses. 
(The figures on the scale are centimetres.) 


is most frequently the case, the average of the two values is recorded. For my 
method of measuring, however, I do not regard as suitable skulls with conspicuous 
asymmetry of the facial skeleton. 

With my instrument I also took the measurements which give the “frontal 
index of facial flatness” of Woo & Morant in such a manner that both arms were 
of the same, or ‘practically the same, length. This frequently necessitated re- 
measuring. I do not believe that the length of the subtense obtained by my method 
differs from that arrived at by using a pair of co-ordinate callipers of the usual form. 

My measurements are as follows: 

(i) The chord JOW, inner biorbital breadth, fmo-fmo of Martin (1928). 

(ii) Sub. TOW, the subtense of the nasion from the chord JOW. 

(iti) The chord GB, bimaxillary breadth, zm-zm of Martin. 
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(iv) Sub. GB, the subtense of the nasolateralia (NI-N1), i.e. the line joining the 
most posterior points on the margins of the pyriform aperture, from the chord GB. 

(v) The chord ZB, bizygotemporal breadth, between the two zygotemporalia 
inferioria (ZT of Woo, 1937). 

(vi) Sub. ZB, the subtense of NI-N1 from the chord ZB. 

(vii) FB, the bizygomatic breadth, zy-zy of Martin. 


Before passing on to the indices, I must comment on the fact that the location 
of the zygomaxillary point (zm) gave me a certain amount of trouble. I did not 
always adhere to Martin’s definition of it, which specifies absolutely the lowest 
point on the zygomatico-maxillary suture. Woo & Morant adopt this definition, 
adding that “if the inferior extremity of the suture is a short length lying parallel 
to the horizontal plane, the anterior point on it is the one accepted”. What is the 
position, however, if the length is not quite short, and if it does not lie quite parallel 
to the horizontal plane, as when the margin is rough and irregular? The fact is 
that we are dealing here with a region of the facial skeleton which reveals a par- 
ticularly high degree of variability, due primarily to differences in the mode of 
origin of the anterior part of the masseter muscle. In many cases the origin does 
not reach the maxilla, and where this is so location of the point presents no diffi- 
culty; but in other cases, and probably for some races in the majority of cases and 
particularly in males, the origin continues for different lengths on the lower border 
of the maxilla, and it often shows fairly strong impressions there. This last con- 
dition was frequently found in the case of the Norwegian skulls which I have 
examined. The same is true for the Eskimo and Australian specimens, but it was 
found much less frequently in the case of the skulls of Lapps and those of some other 
races. There appear to be racial differences in this respect. 

In some cases the lowest point on the suture lies rather far back on a broad 
and rugged surface. Such a point would appear to be quite useless for the purpose 
of measuring the projection-of the facial skeleton, and in fact it may not always 
be possible to reach it with the tip of one of the arms of the callipers when the bar 
is placed on the nasolateral margins. In view of this I was compelled to draw up 
a new definition of the zygomaxillary point, viz. the lowest point of the zygo- 
matico-maxillary suture which still lies on the anterior surface of the bones. The 
adoption of this definition may occasionally alter the length of the chord GB by 
a small amount, but this is of little significance in comparison with the change 
which it will sometimes make in the length of the subtense. 

The definition of the zygotemporal point also seems rather uncertain. Woo 
(1937) defines Z7' as ‘‘the lowest point on the zygomatic suture which is still on 
the lateral surface of the arch”. In some cases, however, it is very difficult to say 
where the lateral surface of the arch ends. The section of the arch shows consider- 
able variation in form. Rectangular sections which make possible an absolutely 
precise location of the point are somewhat scarce. More frequently the zygoma 
has a lateral and a latero-inferior surface which are more or less indistinctly separ- 
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ated from each other. In such cases I have located the point to the best of my 
ability on the borderline between the two. In most cases I have found the chord 
to be 2-3 mm. shorter than the greatest possible breadth between the sutures on 
the two sides. Although the question is of much less importance here than it is 
for the zygomaxillary point, a more precise definition is nevertheless desirable. 

Should the method which I suggest for measuring facial projections in relation 
to the nasolateral margins be generally adopted by craniclogists, it will be necessary 
for them to find indisputable definitions to give the exact location of the extremities 
of the two chords in question. 
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Fig. 2. A horizontal section of the facial skeleton illustrating measurements taken. 


The indices used can be divided into two classes, the first (Nos. viii—xi) in- 
volving subtenses and the second (Nos. xii—xvi) being ratios of pairs of the trans- 
verse breadths. They are: 


(viii) SFi, 100 Sub. [OW/IOW = frontal index of facial flatness, 
(ix) SMi, 100 Sub.GB/GB = = maxillary index of facial flatness, 
(x) SZi, 100 Sub. ZB/ZB = xzygotemporal subtense index, 
(xi) SSi, 100 Sub. GB/Sub. ZB 


(xii) GOi, 100 GB/IOW = maxillo-orbital breadth index, 

(xiii) ZOi, 100 ZB/IOW = zygomatico-orbital breadth index, 
(xiv) GZi, 100 GB/ZB = maxillo-zygotemporal breadth index, 
(xv) GFi, 100 GB/FB = maxillo-facial breadth index, 

(xvi) ZFi, 100 ZB/FB = zygomatico-facial breadth index. 


Finally, I have calculated approximately the angle between the vertical plane 
through the zygomaxillary and zygotemporal points on either side and the median 
sagittal plane (see Fig. 2). Assuming that the two planes zm-NI-Nl-zm and 
ZT-NI-NI-ZT are horizontal and coplanar, the fraction (ZB-GB)/2(sub. ZB-suh. 
GB) is the tangent of the “zygomatic angle” (ZZ). 
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A metrical description of the curvature of the inferior margin of the malar 
bone would be of interest, but my attempts to measure it have been unsuccessful. 
A survey of such a feature would appear to be possible only by using complicated 
projective methods. 


2. THE CRANIAL SERIES MEASURED 


My main material consists of 100 male and 100 female Norwegian skulls from 
medieval churchyards in Oslo, and 100 male and 100 female skulls of Lapps ob- 
tained from various cemeteries in the county of Finmark, the majority of these 
being of fairly recent date. Most of the Lapp skulls form part of the material 
already dealt with by K. E. Schreiner (1931-5), but some of them have been 
acquired at a later date by the Anthropological Institute, Oslo. All of these are 
brachycephalic specimens. The Oslo skulls form part of the material which has 
been described by the same author (1939). I have looked through his records and 
have omitted the small number of skulls with a cephalic index greater than 79-9, 
or an upper facial index less than 50-0. 

The series of foreign skulls in the possession of our Institute are all too small, 
and for tne most part of too miscellaneous a nature, to provide results of any 
importance. I have, nevertheless, examined the best of them for the purpose of 
obtaining comparative data. I measured twenty-five male and twenty-five female 
Eskimo skulls. The majority of these. came from Greenland, but two of the male 
and six of the female specimens are from the opposite coast of Labrador. I have 
found no distinct differences between the two local groups. The other series which 
I measured are: 


Ten male and seven female Indian skulls from different parts of America, all 
of which have common features while none shows artificial deformation; 

Eleven male and eleven female Negro skulls from different parts of Africa, all 
being clearly dolichocephalic; 

Thirteen male and nine female native Australian skulls from different parts 
of the continent; 

Twelve male and nine female Maori skulls from a single cave on the North 
Island of New Zealand. 

The Australian and Maori skulls form part of the material dealt with by 
K. Wagner in his great work (1937). I have only included skulls sufficiently com- 
plete to provide all the measurements. 


3. SEXUAL COMPARISONS: 


Table I gives all the means which I have calculated. We will first examine the 
question of differences between the sexes. In Tables I] and B sex ratios expressing 
the female means as percentages of the male are given for all the characters and 
groups, and in the lower sections of the same tables the differences between the 
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means for the two sexes, together with the standard errors of these quantities, are 
provided for the Norwegian, Lapp and Eskimo series. As will be seen from Table 
IIa, all the male means of absolute measurements are greater than the corre- 
sponding female means, except in the case of Sub. GB for the Negro series. Some 
of the differences for the three longest series are not statistically significant. This 
is so for Sub. IOW in the case of the Norwegian and Eskimo series, although the 
same difference for Lapps is markedly significant. This is an unexpected conclusion 
and no great reliance can be placed on it, particularly in view of the fact that 
K. E. Schreiner has found that in general sexual differences are smaller in Lapps 
than in Norwegians. The peculiarity noted is undoubtedly due to the mixed com- 
position of the Lapp material. Our Lapps do not form 4 homogenéous population, 
being mixed at different places in different degrees with Norwegians and Finns 
(Quains). The skulls were collected.in various localities spread over a wide area. 
Several of the component local series are small and the sexes are unequally 
represented in them. I have calculated the sex ratios of this character separately 
for all the local groups and have found fairly large differences. Some of the group 
means (Kautokeino, Karasjok and Kistrand) give low sex ratios, due, presumably, 
to the fact that the male means are too high to be characteristic of pure Lapps. 

The last column of Table IIa shows sexual comparisons of the zygomatic angle. 
In all series except the Negro the female mean is distinctly smaller than the male, 
although the difference is insignificant in the case of the Norwegian series. 

Sexual comparisons of the indices (Table ITs) are of greater interest. In accord- 
ance with the results of Woo & Morant, the frontal index of flatness (S Fi) shows 
no significant differences, although it appears to have a slight tendency to be lower 
in the female sex. This is also true for the maxillary index of flatness (S.Mi), whereas 
the zygotemporal subtense index (SZi) shows a slight tendency to be greater in 
the female sex (cf. the relations of ZZ). The means of the SS index indicate, in 
accordance with those of the SM index, that the maxillary region tends to be 
somewhat flatter in females than in males. The values of the different breadth 
indices show very small sex differences, but nevertheless they show fairly con- 
sistent relationships. In all groups the biorbital breadth is a little larger relative 
to the bimaxillary breadth in females than in males. The three breadth measure- 
ments of the middle part of the facial skeleton (@B, ZB and FB) show a slight 
tendency to be relatively larger anteriorly than posteriorly in female compared 
with male skulls (cf. ZZ). In my material this relation applies particularly to the 
Eskimos. It may be noted that for their larger series Woo & Morant found a dis- 
tinctly lower mean than I have for the bimaxillary breadth (GB) in female Eskimo 
skulls. The sex ratio of this measurement in their material is only 92-7, as against 
95-2 for the biorbital breadth (JOW). 
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4. RACIAL COMPARISONS 


Of the breadth measurements, the internal biorbital (JO W) shows the smallest 
differences between the series and the bizygotemporal (ZB) the largest. If the 
short Negro and Australian series are disregarded. the smallest means of all the 
breadth measurements are found to be the Norwegian. Woo & Morant give a 
slightly lower mean for the biorbital breadth, and a considerably lower mean 
(3-4 mm. less) for the bimaxillary breadth, for their nineteen Norwegian skulls 
than my values for the medieval Oslo series. The values found by these authors 
for male Anglo-Saxon and medieval English skulls are slightly higher for the 
biorbital breadth, and slightly lower for the bimaxillary breadth, than my 
Norwegian values. With regard to the two breadth measurements in question, 
there is little difference between the Oslo and Lapp series, particularly in the 
case of the male skulls. It may be noted that for the 140 male Lapp skulls 
K. E. Schreiner has given a mean bimaxillary breadth of 95-5 mm., which is 
0-8 mm. less than my value for male Lapp skulls and practically the same as my 
value for male Norwegian skulls. This author has also calculated for 121 female 
skulls a mean (91-7) which is slightly lower than mine. The difference between 
our values may be due to some extent to a difference in locating the zygomaxillary 
point. However that may be, the differences between my female means for 
Norwegian and Lapp skulls cannot be considered statistically significant in the 
case of either JOW (0-81 + 0-51) or GB (1-13 + 0-56). The bizygotemporal breadth 
(ZB), however, is significantly greater in the Lapp than in the Oslo skulls. The 
means of all the breadth measurements of the middle facial skeleton are clearly 
greater for Eskimos than for Lapps, and the biorbital breadth shows differences 
of the same sign, though they are much smaller. It is worthy of note that this 
last breadth is greater, or at all events not smaller, in the remarkably narrow- 
headed Eskimos than in the broad-headed Lapps. The bimaxillary breadth is 
probably greater in Eskimos than in any other human race, and among the known 
peoples of the earth only the American Indians appear to approach them in this 
respect. 

Just as the breadth measurements increase from Norwegian to Lapp and 
from Lapp to Eskimo type, so, to a still greater degree, do the subtenses to JOW 
and GB decrease. Consequently the indices derived from these measurements 
show marked differences. My means of Sub. JOW and of the frontal index of 
flatness for male Oslo skulls are higher than those found by Woo & Morant for 
their Norwegian series, and they accord better with the values given by these 
authors for Swedes. On the other hand, their means of the two measurements 
for Eskimos are greater than mine. 

With their exceedingly low means for the index of maxillary flatness (SMz), 
the Eskimo differs greatly from the other groups examined, including the Indian 
which otherwise bear some resemblance to the Eskimo. In fact, I know of no 
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characters more capable of indicating the peculiarity of the Eskimo skull than 
this index taken together with the chief cranial indices. Nevertheless, the 
Eskimos are extreme in nearly all characters, and the question remains of the 
extent to which the index can be considered of value for racial classification in 
general. I can only contribute a little towards the solution of this problem by 
comparing my Norwegian and Lapp skulls. The means of the index of maxillary 
flatness reveal the following differences: ¢ 2-56, 93-43, with standard errors of 
0-40 and 0-38, respective y. The differences are of marked significance. 

The zygotemporal subtense (Sub. ZB) and its index (SZi) show entirely 
different relations. They are associated less with the “flatness” of the facial 
skeleton than with the antero-posterior lengths of the calvaria, and facial skeleton. 
Among the groups which I have examined the Lapp skulls of both sexes have the 
smallest means for the subtense. The Eskimo values are distinctly higher for the 
subtense, but owing to its considerable zygotemporal breadth the type has the 
lower index. The Norwegian skulls have distinctly higher means, both for the 
absolute measurement and for the index, than Lapps and Eskimos, but their 
means are exceeded by those for the prognathous Negro and Australian skulls. 

The index SSi, which gives Sub. GB as a percentage of Sub. ZB, shows the 
highest means for the Oslo skulls and, as a matter of course, very low means 
for the Eskimo skulls. 

Among the indices which relate the different breadth measurements to one 
another, the maxillo-orbital breadth index (@Oz) would appear to be of value for 
racial classification, as it appears to differentiate families of races. Judging 
from the means given by Woo & Morant for the biorbital and bimaxillary breadths 
the index lies below 100—that is to say, the former is greater than the latter 
breadth—in all European, southern Asiatic, the Australian and the majority of 
African populations, while it exceeds 100in the case of eastern Asiatic, the majority 
of American, and a few African and Oceanic populations. As regards my material, 
the Lapps appear to deny their presumed Mongolian origin, since their values for 
this index do not exceed the Norwegian to a significant extent, while the Eskimos 
and Indians have means distinctly above 100. 

The ZO index rises considerably on passing from the Norwegian to the Lapp 
and then to the Eskimo series. As regards the other indices of breadth, I will 
merely refer to the low values for the ratio of the bimaxillary to the facial breadth 
(G Fi) of Lapps, who are not only distinguished by their weak mandibles, but also 
by their weak maxillary bones in contrast to those of Eskimos. 

Finally, something must be said of the zygomatic angle. As the calculation of 
this is based on four different measurements, its value may be influenced by as 
many errors. Moreover, the two subtenses which are used do not lie in exactly 
the same plane. Even granting these defects, the angle, nevertheless, is fairly 
expressive as illustrating a feature which is not measured by the indices. I have 
previously dealt with the sexual differences, and will here confine myself to calling 
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attention to the difference between the means for Lapps and Eskimos. Both 
these types are characterized by broad and flat faces with what are called “high 
cheek bones”’. The low values for Eskimos may appear to be unreliable, but doubts 
as to this will disappear on inspection of the skull of an Eskimo from the base. 
The unusual breadth of the Eskimo face is due partly to the great bimaxillary 
breadth, and partly to the considerable curvature of the malar bones, but the 
posterior parts of these bones are long and the zygomatic arches only protrude 
slightly in a lateral direction. The bimaxillary breadth of the Lapps, on the other 
hand, is distinctly smaller, the malar bones are rather short in a transverse 
direction and the.arches protrude much more laterally. 


5. VARIABILITIES 


In Tables I{[a and B will be found the standard deviations, while Tables 
IVa and B give the coefficients of variation, for the Norwegian, Lapp and Eskimo 
series. The bimaxillary (GB) is the most variable breadth and the bizygotemporal 
(ZB) is the least variable, while the bimaxillary is the most variable subtense— 
judging by coefficients of variation—and the bizygetemporal is the least variable. 
In the case of the subtense indices, also, the index of maxillary flatness (SMi) 
varies most, while the zygotemporal index varies least. These relations are 
undoubtedly due to the great variation in the form of the facial skeleton in the 
region of the zygomaxillary suture, which affects all the indices involving the 
bimaxillary breadth. As a test of sexual and racial differences in variability I am 
restricted to my Oslo and Lapp material. With regard to this question, I am 
bound to admit that, as long as we cannot count upon absolute accuracy in sexing 
skulls, very little emphasis can be laid upon any difference found. In my material 
male variability tends on the whole to be slightly greater than female, both as 
regards absolute measurements and indices, except in the case of Sub. JOW. 

Table V gives the average coefficients of variation in the two male and two 
female series for three breadth measurements ([OW, GB and ZB), for the three 
corresponding subtenses, for the three corresponding subtense indices (Nos. 
viii-x), and for the three corresponding breadth indices (Nos. xii—xiv). Averages 
are also given for all six absolute measurements, for all six indices and, finally, 
for all twelve characters. At the bottom of the table the corresponding averages 
are recorded for the male plus the female constants. It will be observed that most 
of the male averages slightly exceed the female. As regards racial differences, the 
average for all six absolute measurements is greater for Lapps, but the average 
for all six indices is greater for Norwegians. All the differences between the 
averages for the two series are, however, very small. 
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TABLE V 
Average coefficients of variation for different sets of characters 
6 Absolute 
ae 3 Sub- 3 Subt. 3 Br. . All 12 
Series 3 Breadths| tenses indices indices | Measure- | 6 Indices | characters 
ments 
Norwegian 3 4-05 11-33 11-02 4-68 7-69 7-85 7:77 
3-89 1114 10-97 411 7-52 7-54 7-53 
Lapp 3 4-92 11-42 11-27 4-20 8-17 7-74 7-95 
al 3-67 11-68 10-76 3-98 7-68 7:37 7-52 
Norwegian + Lapp 3 4-49 11-38 11-15 4-44 7-93 7-79 7:86 
3-78 11-41 10-87 4-05 7-60 7-45 7-53 
Norwegian $+ 3-97 11-24 11-00 4-40 7-65 7-70 7-65 
Lapp 3+2 4-30 11-55 11-04 4-09 7-93 7-56 7-74 


6. CONCLUSIONS 


The chief result yielded by my study is that measurements of the subtenses 
from the most posterior points on the margins of the pyriform aperture to certain 
other bilateral points would be a valuable addition to the routine technique 
followed in describing series of skulls. Of these measurements the maxillary 
index of facial flatness appears to be the most useful, but the maxillo-orbital 
breadth index also gives comparisons of considerable interest. The adoption of 
these characters in craniological research will necessitate more precise definitions 
of the zygomaxillary and zygotemporal points than those used hitherto. Further- 
more, a new instrument should be designed for measuring projections from the 
nasolateral margins. 
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TRANSPOSITION OF THE VISCERA AND OTHER 
REVERSALS OF SYMMETRY IN MONOZYGOTIC 
TWINS 


By E. A. COCKAYNE, D.M., F.R.C.P. 


THE twins, Eileen and Joan C., were first brought to my notice by Dr Reginald 
Lightwood, who saw them when they were six years old at the Hospital for 
Sick Children, Great Ormond Street, and found that one had dextrocardia and 
the other was normal. He remembered that they were very much alike and 
thought they were monozygotic. With some difficulty I got into touch with 
them, and Dr James Graham, Assistant County Medical Officer, Essex C.C., 
kindly examined the parents and the surviving brothers and sisters and found 
that in all of them the heart was in the normal position and no cardiac abnor- 
mality was present. The other children are: Eric, aged 21; Albert, aged 18; 
Doris, aged 16; Gladys, aged 12; Betty, aged 11; John, aged 9; and Iris, aged 
5 years. One boy died in 1919, aged 3}, of lymphatic leukaemia. The twins, 
born in August 1924, were 13 when examined. The parents are English and are 
not blood relations. No other case of transposition of the viscera is known to 
have occurred in the family: 

The mother says the twins weighed 3 Jb. at birth and that there were two 
afterbirths, i.e. each had a separate placenta. Clinically the heart is on the right 
in Eileen and on the left in Joan, and the size and sounds are normal in both. 
An electrocardiogram of Eileen taken by Dr J. L. Lovibond at the Middlesex 
Hospital shows inversion of all waves in lead 1, but that of Joan is normal (see 
pp. 290, 291). An X-ray of the chest and abdomen and a barium swallow showed 
dextrocardia with the stomach on the right and the liver on the left side in Eileen, 
and the normal position of the viscera in Joan. 

The twins are very much alike in appearance and, though it is possible to 
distinguish one from the other when together, there is very little doubt that they 
are monozygotic. In view of the rarity of such a mirror image condition a 
number of confirmatory observations were carried out. Eileen is right-handed, 
while Joan is left-handed. Their handwriting is very much alike, but is unformed, 
and it is difficult to say how far the resemblance is due to teaching. B, R, andr 
are formed in exactly the same way in both. 

Miss Ida Mann examined them and made the following report: 

Eileen. Visual acuity 6/6 right and Jeft. Hypermetropia in the right eye and 
hypermetropic astigmatism in the left. In both eyes the error is slightly higher 
than in her twin’s. Retinal arterial pattern dissimilar in right and left eyes and 
does not resemble her twin’s; practically no mesodermal pigment, well marked 
lesser circle and no remains of pupillary membrane. The right eye is the master eye. 
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Joan. Visual acuity 6/6 right and left. Low-grade hypermetropic astig- 
matism in both eyes. Axes oblique. Retinal arterial pattern dissimilar in right 
and left eyes and does not resemble her twin’s. Iris pattern the same in both eyes 
and exactly similar to her twin’s, practically no mesodermal pigment, well 
marked lesser circle and no remains of pupillary membrane. The left eye is the 
master eye. 

Miss Mann says that it is usual to find the retinal arterial pattern different 
in the two eyes of the same person and in corresponding or opposite eyes of mono- 
zygotic twins. According to Viggo Eskelund no two persons show an identical iris 
pattern, but a broad classification into types is possible and these types are 
determined genetically. Unfortunately the iris pattern in the twins is a very 
common one. 

Dr Phyllis Kerridge tested their hearing in a sound-proof room and both 
gave very similar graphs, bone conduction being normal and air conduction low 
normal. 

Dr G. M. Morant took the following measurements in millimetres: 


Eileen Joan 
Stature 1429 1437 
Left cubit 396 396 
Right cubit 397 396 
Maximum head length 180 184 
Maximum head breadth 136 136 
Head height (an uncertain measurement) 128 125 
Cephalic index : 75°6 73-9 
Bizygomatic breadth 121 121 
R. L. R. L. 
Maximum length of ear 56 56 55 57 
Length of phalanges with fingers strongly flexed at metacarpophalangeal joints: 
2nd digit 92 92 94 93 
3rd digit 99 101 103 102 
4th digit 93 95 98 98 
5th digit 76 76 75 78 


The girls are about 2} in. below the English average stature for their age and 
social class. The small differences between measurements suggest that they are 
monozygotic. The small difference between head lengths (leading to one between 
the cephalic indices) is probably the most significant. Joan has slightly longer 
digits than Eileen As all the measurements are subject to errors of at least 
1 mm. it is safest to conclude that they do not provide any evidence of asym- 
metry. 


He also took photographs and reported as follows: The photographs are of 
three kinds. 

(I) Profiles. The outlines of the faces of the two girls are remarkably similar, 
their upper lips being unusually short. Eileen’s chin is rather longer and 
straighter than Joan’s. No difference between the ears (not all visible) was noted. 
Both have free lobes. They provide no. evidence of asymmetry. 

(Il) Full-face. The photographs are not truly full-face, rather more of the 
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right than of the left being shown in each case. The outlines are very similar, 
Joan having a rather shorter and more pointed chin than Eileen. The teeth 
show a slight anomaly, which is the same in both girls. The central upper incisors 
are crossed, the left overlapping the right. 

A palmaris longus muscle is present in both twins on both sides. Hair colour 
in both twins matches scale 7. The hair whorl was clockwise in both twins. 

Miss David took fingerprints and made the following report: 

A glance at Waite’s tables shows that radial loops are nearly as common as 
ulnar loops on the forefinger. The remaining fingers are all ulnar loops, which are 
the commonest of all patterns. There is a distinct similarity between the counts 
of these ulnar loops and in the actual pattern. In both twins the actual counts 
approximate to the median value of the distance of ridges in loops. The thumbs 
are interesting and at first sight appear very similar, but there is a difference 
as shown in the counts, and the right thumb of Eileen is like the left thumb of 
Joan, while the left thumb of Eileen is like the right thumb of Joan. The most 
striking thing is that Eileen has a radial loop on the forefinger of the right hand, 
and Joan has a radial Joop on that of her left hand, and Eileen has an ulnar loop 


on the forefinger of the left hand and Joan has an ulnar loop on that of the 
right hand. 


Right hand Left hand 
Joan Eileen Joan Eileen 
T 23C9 21C6 5 C 20 11 C 24 
1 UL 13 RL 5 RL5 UL 13 
2 UL 13 UL 14 UL 12 UL 138 
3 UL 12 UL 12 UL 12 UL 13 
4 UL 13 UL 12 UL 13 UL 10 


UL= Ulnar loop. RL=radial loop. C=composite. 
The numbers are the number of ridges. 


The palmar and plantar patterns are similar, but those of Eileen’s left side are 
more Jike those of Joan’s right side than those of her own right side and vice versa. 

Dr G. L. Taylor finds that the blood groups in both twins are O M N. Both 
are able to taste phenyl-thio-carbamide. The other taste tests were unsatisfactory 
because they were unable to discriminate clearly between sour and bitter. 

At school the twins were in the same class and much alike in mentality. 
Questioned in different rooms with no opportunity of overhearing one another or 
comparing notes each said that her favourite colour was blue, one immediately 
and the other after some hesitation. Neither knew her sister’s favourite colour. 

I think the resemblance in general appearance and the agreement in so many 
independent characters, a number of which are known to be determined genetic- 
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ally, proves that Joan and Eileen are monozygotic. They differ from the majority 
of such twins in the extent to which they are mirror images of one another: 

Eileen with complete transposition of the viscera. 

Joan with normal viscera. 

Eileen right-handed with the right eye the master eye. 

Joan left-handed with the left eye the master eye. 

Eileen with a radial loop with five ridges on the right forefinger. 

Joan with an radial loop with five ridges on the left forefinger. 

Eileen with an ulnar loop with thirteen ridges on the left forefinger. 

Joan with an ulnar loop with thirteen ridges on the right forefinger. 

Eileen with thumb ridges of the right hand closely resembling those of Joan’s 
left hand, and with thumb ridges of the left hand closely resembling those of 
Joan’s right hand. : 

I can find records of only four similar cases in the literature. Baron (1825) 
showed one before the Section of Medicine of the Academy of Medicine, Paris, 
in December 1825. A short report says that the twin with transposed viscera 
was a girl, who died at the age of 8 days. Kuchenmeister states that the twins 
were monozygotic and that one had normal viscera and the other complete 
transposition. Miller (1893) records a case of monozygotic twins, one with situs 
inversus viscerum and the other with situs solitus, but gives neither their age 
nor sex. Dubreuil-Chambardel (1927) records male twins, aged 25, identical in 
general appearance, in weight, height, and in physical characters. Each had 
varices on the lower limbs of the same type and developing in the same way. 
One had a harelip on the left, the other on the right, and one had complete 
transposition of the viscera, while the other was normal in this respect. The 
visceral condition was confirmed by radiological examination. I have been unable 
to refer to the original account of the twins recorded by Araki (1934), but Taku 
Komai, though he gives neither their age nor sex, says they were monozygotic 
and that one had normal viscera and the other complete transposition. Taku 
Komai says that this is the fifth case recorded. Possibly he includes that re- 
corded by Tamm and later by Betschler. This refers to female twins, stillborn at 
the seventh month, one normal, the other oedematous and hydrocephalic, with 
the heart and aorta on the right side, no lung on the right and a small one on the 
left. The other viscera were correctly placed, but the liver was small and the 
kidneys large. There was one placenta with two cords implanted centrally with 
the umbilical veins anastomosing. 

Newman in his Biology of Twins shows that the armadillo, Dasypus novem- 
cinctus, normally produces monozygotic quadruplets. He makes the following 
observations. At first there are two embryos and later a secondary embryo is 
formed to the left of each primary embryo. Asymmetry of the scutes is not 
uncommon and in the quadruplets all grades of mirror image formation are 
found. There may be mirror-imaging between individuals of opposite pairs 
(primary embryos), but this is much less common than imaging between twin 
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partners derived from one half of the egg (a primary embryo and the secondary 
embryo to its left). In general mirror-imaging between “opposites” is evidence 
of a residuum of a primary bilateral symmetry that held sway in the blastocyst 
before polyembryonic fission began. When the primary outgrowths are formed, 
they are the product of the antimeric halves of the first embryo and should 
therefore show mirror image relations. But a partial physiological isolation of 
the two halves permits a certain reorganization or regulation of new symmetry 
relations, which tends more or less completely to destroy the original symmetry, 
yet often leaving a trace of the latter. Finally when each secondary outgrowth 
organizes its own bilateral symmetry, it tends to lose, partially at least, the 
earlier symmetry relations, and to establish its own mirror-imaging of right and 
left halves. In some cases traces of all three symmetry systems appear in a single 
set of foetuses, but it is common to find only two systems interacting. Newman 
says that no case of visceral reversal was found in spite of a careful search 
through a large number of foetuses. This, he thinks, is due to the fact that 
twinning is initiated and carried out in the ectoderm, and that the endoderm 
becomes involved only passively and considerably later. 

In the case of human monozygotic twins there may be almost complete 
mirror image arrangement of palmar patterns so that the left hand of x corre- 
sponds with the right hand of y and vice versa. This corresponds with the rather 
rare mirror-imaging of “opposites’’ seen in armadillo quadruplets and in 
Newman’s opinion goes far to prove that polyembryony actually occurs in man. 
A commoner manifestation in man is reversal of symmetry in the pattern of an 
index finger of one twin so that it mirrors the condition in the corresponding 
index finger of the other twin. This reversal of pattern occurs in both hands of 
Eileen and Joan. 

In addition to the ridges on the skin other epidermal structures may show a 
mirror image arrangement, such as naevi and angiomata, and Siemens (1924) 
records an accessory nipple below the normal one, which was on the right side 
in one and on the left side in the other twin. 

Newman believes that monozygotic human twins become physiologically 
isolated at a considerably earlier period than do armadillo quadruplets and 
founds his belief on the fact that there is so little mirror-imaging in the former 
and so much in the latter. He says that as a general rule the earlier the separa- 
tion the more complete is the reorganization of the symmetry relations in the 
separate individuals and the less residuum of the original common symmetry. 

In double monsters of moriozygotic origin there is, according to Dubreuil- 
Chambardel (1927), frequently a hare-lip, which is axial in both or lateral in 
both, and as a rule it is of the same degree, and the hare-lip of one component is 
always a mirror image of the hare-lip of the other. He points out as many others 
have done how often one component of a double monster has complete or partial 
transposition of the viscera, the arrangement being that the apices of the two 
hearts point away from the place where the twins are attached to one another. 
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There can be no doubt that neither the hare-lip nor the transposition of viscera 
in double monsters is of genetic origin. Hare-lip in man is sometimes determined 
genetically, but the incidence in double monsters is far too high for this to be 
true in their case. Newman says that in the case of double monsters fission must 
begin much later than usual and is never completed. While in the armadillo 
polyembryony is the rule and fission is standardized, in man it is the exception 
and there is considerable variability in the period at which fission takes place. 
Transposition of the viscera in man is recessive to the normal arrangement, and 
elsewhere I have suggested (1938) two ways in which it could affect only one 
member of a pair of monozygotic twins. Apart from the improbability of so un- 
usual an occurrence happening in at least five recorded cases, neither explanation 
accounts for the occurrence of other mirror image arrangements, and any genetic 
explanation must be abandoned. If Newman’s views about mirror image forma- 
tion are accepted, the very rare cases of human monozygotic twins with a 
mirror image arrangement of the viscera in addition to that of ectodermal 
structures are due to an unusually late fission. The fact that they are so rare 
seems to show that unless fission is very late each twin reorganizes its own 
symmetry and, if genetically normal, the viscera are normal in both, but if the 
zygote is a homozygous recessive for transposition of the viscera, the viscera are 
transposed in both. Apparently few twins in which late fission occurs succeed in 
becoming completely separate, for double monsters with their own mirror image 
symmetry, i.e. with the viscera transposed in one and normal in the other, are 
far commoner than separate twins with this arrangement. The separate twins, 
which do show this arrangement have in fact just escaped being double monsters. 


My thanks are due to Prof. R. A. Fisher for kindly putting the resources of the 
Galton Laboratory at my service and to Dr Julia Bell for her help and advice, 
and to Miss David, Dr Taylor and Dr Morant. I wish also to express my gratitude 
to Miss Ida Mann and to Dr Lovibond. Dr Graham had intended to write a 
report on the twins, and I should like to thank him for allowing me to do so 
instead and for examining the other members of the family. 
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THE HUMAN REMAINS OF THE IRON AGE AND OTHER 
PERIODS FROM MAIDEN CASTLE, DORSET 


By C. N. GOODMAN anp G. M. MORANT 


1. INTRODUCTION 


ExcavaTIons at Maiden Castle, the largest fortified site of prehistoric date in 
England, were conducted under the direction of Dr R. E. M. Wheeler during 
four consecutive seasons from 1934 to 1''37. The discoveries made were of unusual 
archaeological interest, and they include a number of skeletal remains which 
were preserved with unusual care. The bones available represent 104 individuals 
ranging in age from a foetal to a senile stage of development. This paper is a 
report on the eighty-three skeletons which are sufficiently well preserved to be 
of some anthropological interest, and the periods, sexes and age groups to which 
they are assigned are given in Table 1. An unpublished report on the excavations 
by Dr Wheeler contains particulars of each of the 104 individuals, and a section 
by the present writers in the same work gives a summary of the general con- 
clusions reached below and fuller descriptions of the mutilated specimens. 


TABLE I 


The periods, sexes and ages of individuals from Maiden Castle whose 
skeletons were measured 


Period or group Adult Adult ¢ Infant | Totals 
Neolithic 3 1 4 
Iron Age A 1 — a — 4 5 
Iron Age B 2 7 1 —_ 3 13 
Iron Age 8 8* 17 
Belgic 1 1 2 
Belgic War Cemetery 20 9 — 1 1 31 
Romano-British 3 3 6 
Iron Age or Romano-British 3 1 — — — 4 

Totals 41 30 1 2 9 83 


* Including one individual (T 28) classed as Iron Age C or Romano-British. | 


The metrical study of the skeletons made hitherto has been confined to the 
crania and mandibles, on which all the customary measurements have been 
taken, and to the clavicles and long bones of the arms and legs, described by 
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lengths only. Individual measurements and remarks for the crania and long 
bones are given in the appended Tables VII and VIII; remarks on the mandibles 
are given with those on the crania, but the individual measurements of these 
bones are not provided. Contours of the crania have been drawn, but they are 
not used here, as additional English material of the same periods would be 
needed to furnish sufficiently reliable type figures. The writers hope to undertake 
a more detailed examination of the long bones and an examination of the other 
bones of the skeleton. 


TABLE II 
Maximum lengths of the shafts of long bones of infants of Iron Age 


date and of modern foetuses 
No. |GML| QU| P1|RU|T19] O | P35 
Year excavated ... | 1936 | 1937 | 1937 | 1937 | 1937 | 1937 | 1937 | 1936 | 1937 — _— 
Period* ae ee A A A B B B Ct | Ct | Modern$} Modern|| 
Femora R. — | 71) — | 7329| — _ — | 849) 65-1 58-0 
L. — | %H1| — — | 85-2 | 76-9 | 72-4) 815) — 65-0 58-0 
Tibiae R. — | 640] — | 68:2) — | 67:0 | 68-2) — 56-1 51-0 
L. — | 64-2] — | 688) — | 67-1 | 688) — | 72-1 56-5 51-1 
Fibulae R. — | 610) — —_ — | — | 61-2 | 680] — = 48-2 
L. — | 61-2) — eee See — | 612) — —_— 53-0 48-2 
Humeri R. 70-5 | 64-1 | — | 66-0 | 74-0 | 67-0 | 66-3 | 71-4 | 73-0 58-1 53-1 
L. 70-7 | 64-8 | 75-5 | 67-38 | — —_ — | 71-4 | 73-0 58-2 53-0 
Radii R. — | 49-0 | — | 54-4 | 58-2 | 52-7 | 51-8 | 55-5) — 45-0 44-0 
L. — | 49:1] — | 54-4] 58-0 | — | 51-2 | 55-7 | 57-9 45-0 44:0 
Ulnae R. 64-4 | 58-2 | — | — | 66-2] 61-1] — 53-0 49-0 
L. — | 580; — 61-4 | — — | 588 | 645) — 53-0 49-0 
Clavicles R. — | 460) — — | 483 | — | 42-1 | 49-1 | 46-4 39-2 34:5 
L. 47-0 | 460 | — — | 484) — | 426) — — 38-8 34-0 
* A, Band C refer to the chronological divisions of the Iron Age. + Belgic. 
+ Belgic War Cemetery. § Full term. || 7 months foetus. 


Nearly all the skeletons are incomplete, but in the majority of cases the sexes 
of the adult and adolescent individuals can be judged, principally from the 
pelvis, with a reasonable assurance of accuracy. It was suspected at first that 
nine skeletons, which are remarkably well preserved in view of their tender age, 
were foetal. The maximum lengths of the shafts of the long bones for these—the 
measurements having been taken in any direction by using the flat arms of a 
pair of small calipers—are given in Table II. The last two columns on the right 
of the table give the same lengths for a child at birth and a seven months foetus. 
These two specimens are preserved in the museum of -the Department of 
Anatomy, University College, London, and we are indebted to Dr Matthew 
Young for giving us access to them. All nine of the skeletons from Maiden Castle 
are appreciably larger than the two of modern date, and hence it must be 
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supposed that the former are of young infants, probably all from three to six 
months old. No other measurements of them were taken. 

The remnaining seventy-four skeletons are available for racial comparisons, 
and the majority of these are of Iron Age date. Thirty represent the population 
of the site at the very end of the period, as they came from a cemetery used for 
the Belgic defenders of the Castle who were slain by Roman invaders in the year 
A.D. 43. Nearly half of the skulls in this series bear witness to the event in the 
form of sword-cuts and other mutilations. The Belgic War Cemetery series is 
just long enough to make statistical comparisons between it and other groups 
worth while, but all the other groups distinguished in Table I are clearly too 
small to stand alone. In some comparisons below a composite series made up by 
pooling all the other individuals of Iron Age or Romano-Sritish date is con- 
sidered, either apart from the Belgic War Cemetery series or combined with it. 
This procedure is of provisional value only, of course. There may have been 
changes in the racial composition of the population of Maiden Castle during the 
Iron Age and again in Roman times, and the evidence available is quite in- 
sufficient to show whether this was so or not. 

It should be appreciated, too, that it is not unlikely that a small community, 
which may have been peculiar owing to relative inbreeding, may have persisted 
there for several generations, and the characteristics of such a local group may 
mislead if it is taken to typify the large racial population of which it formed a 
special part. The new material from Maiden Castle provides a very welcome 
addition to our meagre knowledge of the physical characters of the inhabitants 
of England in Iron Age times. It may be hoped that it will form a nucleus to 
which other specimens of the period—both those already housed in museums 
and others as yet undiscovered—may be added, until the evidence is abundant 
enough to satisfy the most exacting anthropologist. The object of this paper is to 
provide a descriptive record of the skeletons available with such a hope in view, 


and any results regarding racial relationships made in it are intended to be of 
a tentative nature only. 


2. NON-METRICAL FEATURES OF THE MAIDEN CASTLE SERIES 


The remarks on the Iron Age and Romano-British series given in this section 
relate almost entirely to the skulls. Comments on a few of the long bones which 
exhibit gross abnormalities are given in Table VIII, and the remaining parts of 
the skeletons have not been examined. The Neolithic skeleton of a young man 
(Q 1) was extensively mutilated, and otherwise it is not remarkable. The skull- 
cap of the same period is also that of a young man. The Saxon skeleton (Q) is 
male but the age at death cannot be estimated as the skull is missing. The 
following remarks refer to the remaining specimens, which are all of Iron Age 
or Romano-British date. 
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Estimates of the ages at death of the adults were obtained by noting the 
conditions of the coronal, sagittal and lambdoid sutures, and the ectocranial 
aspects of these give the following frequencies: 


Sutures begin- 
All ges ning to close or All — Totals 

partly closed 
d Belgic War Cemetery series 5 (28%) 12 (67%) 1(6% 18 
Others 3 (21%) 9 (64%) 2 (14%) 14 
Total series 8 (25%) 21 (66%) 3 (9%) 32 
g Belgic War Cemetery scries 3 (43%) 4 (57°%) —_ 7 
Others 1 (37% 10 (53%) 2 (11°) 19 
Total series 10 (38%) 14 (54°%) 2 (8%) 26 


It is probable that all the people buried in the Belgic War Cemetery were 
massacred. The age constitutions of the short adult series are very similar, 
however, to those of the series made up by other people interred at Maiden 
Castle, who probably died from natural causes. The massacre must have been 
carried out without regard to age, though there appears to have been some sex 
discrimination, since twice as many men as women were excavated from the 
cemetery (see Table J). The percentage frequencies for the total series show the 
usual sex differences, due to the fact that the sutures tend to close at an earlier 
age in males than in females. Comparison with values obtained in the same way 
for other series (see Risdon, 1939, p. 107) shows that the Iron Age men died at 
a rather younger age, on the average, than those interred in English cemeteries 
of a later date, but that the women are not distinguished in the same way. In 
considering the frequencies of anomalies, the total series from Maiden Castle is 
referred to below. 

The sagittal suture was normally the first to close, followed by the coronal 
and then the lambdoid. Only one specimen (P 23, male) is definitely anomalous 
with regard to the condition of the principal calvarial sutures. Its sagittal 
suture is obliterated, the coronal is beginning to close and the lambdoid is open. 
This specimen shows no apparent sign of distortion and its cephalic index (77-4) 
is above the average, so it may be assumed that the sagittal suture was not 
obliterated before maturity was reached. Of thirty-three male frontal bones 
only one (P ii, an isolated bone for which no measurements are given) is metopic, 
and there is only one female metopic specimen out of a total of twenty-seven 
frontal bones. Among Western European crania metopic specimens are usually 
found with a frequency of about 10°%, but the Iron Age series are so short that 
no significance can be attached to the fact that the frequencies for them are 
exceptionally low. There are no examples of interparietal bones, or of an os 
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épactal, among the thirty-two male and twenty-six female occipital bones. A 
much rarer anomaly is exhibited by a male cranium (P 7A). This is a trace of 
a suture (33 mm. long) extending across the posterior part of the right parietal 
bone: its position can be seen from Plate ITB. One male skull (P 27) shows the 
occipito-mastoid suture obliterated prematurely on both sides. There is one case 
of fronto-temporal articulation on both sides among the males, and one of the 
pterion in K on both sides among the females. One female specimen has both 
malar bones divided by horizontal sutures. A male skull (O 3, Plate IIIB) has 
the basi-occipital broken exposing the extension into it of the sphenoidal air 
sinuses, the condition exhibited by the Swanscombe skull. There is one large 
cavity and the extremity of a small one to the right of it. 

Wounds that had healed completely during life were noted on the cranial 
vaults of three of the male and two of the female skulls. None of these was 
severe and a female specimen (T 22) has a depression on the right temple which 
was more serious than the other injuries. An adolescent female skull (P 20, 
Plate IT A) has the right malar bone deformed, probably as the result of a wound. 
By far the most serious injuries of traumatic origin are shown by the long bones 
of one of the males (T 5). His right elbow was shattered, involving complete 
separation of the proximal extremity of the ulna, his left radius was fractured 
near the wrist, and his fractured left fibula became fused to the tibia. Healed 
fractures of the long bones were only found in the case of two other skeletons, 
viz. P 23 (male, right ulna) and P 36 (female, left fibula). Pathological conditions 
of bones not of traumatic or dental origin were only noted in the case of one male 
skull (T 9) which has a swelling on the right maxilla below the orbit which was 
probably caused by a tumour, of a skeleton of the same sex (P 25) which has 
osseous deposits on the muscular ridges of the left femur and fibula, and of a 
female (T 12) which has the left humerus markedly deformed, probably as the 
result of an inflammatory condition of the surrounding soft tissues. We are 
indebted to Dr A. M. El Batrawi for help in interpreting the conditions of these 
and other specimens in the series. 

The teeth of the individuals interred at Maiden Castle are by no means well 
preserved. The following frequencies are found for the total “Iron Age” series: 


Upper jaw Lower jaw 

3 3 2 

No teeth lost before death 12 8 22 13 
One or more teeth lost before death 16 15 10 12 


In the case of excavated series of skulls which are not of recent date, it is 
customary to find for either sex that more than half the specimens had lost no 


| 
= 


300 Human remains of the Iron Age and other periods 


teeth from the upper jaw before death. Abscess cavities were found in several 
of the jaws. A female specimen (T 21, Plate III A) has two symmetrically dis- 
posed cyst cavities in the buccal surfaces of the maxillae at the roots of the 
second premolars. There are examples of crowded front teeth. A male mandible 
(P 23) has a milk molar retained in the position of the left second premolar, 
which had not appeared, and a female bone (P 26) appears to have had only three 
incisors erupted. The upper jaw of a female skull (T 28, Plate III D) shows the 
canines missing and the first premolars rotated. A female mandible (P 14, 
Plate III E) has the left third molar impacted but almost fully erupted: there are 
also distinct swellings on the inner alveolar margin extending from the first 
premolar to the first molar on the left side, and from the second premolar to the 
first molar on the right. Several of the Sinanthropus mandibles exhibit this 
condition (torus mandibularis), and it occurs more frequently, and to a more 
marked extent, in some modern races of man (particularly in Eskimos), but is 
seldom found in European series. Dental anomalies appear to be exceptionally 
frequent in the short series of skulls from Maiden Castle. 

In general appearance the specimens show considerable individual differences, 
but nearly all might be considered quite unexceptional if found in any British ~ 
collection of later date than the Bronze Age. There are few examples, however, 
of the markedly retreating frontal bones which characterize the seventeenth- 
century London skulls, particularly those from Farringdon Street. The variation 
exhibited appears to be no greater than that expected for a community of inter- 
marrying people, except for the fact that one female skull from the Belgic War 
Cemetery (P 36, Piate ITC, D) stands apart from the others on account of its 
aberrant form. The facial skeleton has an unusual premaxillary height, though 
it is not prognathous; the nasal index is high but not extreme, and the nasal 
bridge is broad and depressed. The cephalic index of this specimen (87-0) is 
easily the highest for the series, and the individual was decidedly short 
(1451 mm. = 4 ft. 9} in.), though taller than two other women interred in the 
Belgic War Cemetery. It is possible that the skeleton P 36 is that of an alien 
in Western Europe, but it appears to be more probable that its peculiarities are 
of individual rather than racial significance. Its measurements were included in 
computing averages. 


3. METRICAL COMPARISONS OF THE IRON AGE AND ROMANO- 
BRITISH CRANIA 


Excluding the two Neolithic specimens, there are 30 adult male and 26 adult 
female crania sufficiently complete to give measurements, though most of these 
are defective to some extent. Individual readings are in the appended Table VII. 
In spite of the small numbers, it appeared worth while computing separate means 
of the more important characters for (a) the Belgic War Cemetery series, and 
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(6) all the other specimens of Iron Age or Roman date. The latter is a miscel- | 
laneous collection made up by individuals belonging to the following groups: f. 


Male | Female Male | Female 
Iron Age A 1 — Romano-British 3 3 
5 B 1 7 Iron Age or Romano-British 2 1 
Cc 6 8 


The pooling of this material, and treatment of it as if all the specimens 
represented a single racially homogeneous population, is obviously a pis aller. 
The means found for the composite group (Table III) are actually very close to 
those found for the Belgic War Cemetery series, which is almost certainly made 
up by individuals who belonged to a single community. On the supposition that 
the variabilities of the groups were of the usual order, all the differences between 
the two sets of means are quite insignificant. Both types have surprisingly large 
basio-bregmatic heights, and hence unusual indices (100 H’/Z and 100 B/H’) 
involving this diameter. Otherwise, they appear to have no features which can 
} be considered at all peculiar in English series of post-Bronze Age date. 

This comparison provides some justification for combining the two sub- 
groups, to give a single series which may be supposed to represent the population 
of Maiden Castle from Iron Age A to Roman times, though it is clear that the 
evidence available is quite inadequate to demonstrate that the racial constitution 
of the population remained stable throughout the period. The following distri- 
butions are found for the total sample. 


Cephalic index i 
(central values) 67 69 | 71 wo | %. } TT 79 | 81 83 | 85 | 87 |Totals 
3 1 1 2 5 6 7 1 |] 2 
2 4 9 4 2 1 | 2 
Height-length 
index 68-5 | 69-5 | 70-5 | 71-5 | 72-5 | 73-5 | 74:5 | 75:5 | 76-5 | 77-5 | 78-5 | 79-5 | Totals 
(central values) } 
3 3 1 35| 35] 2 2 1 | 21 
2 1 1 1 2 4 6 1 2:5) 1 22 


The highest cephalic index (87-0) is for a female specimen (P 36), in the 
Belgic War Cemetery series, which is also remarkable on account of the fact that 


its facial skeleton is of an unusual form. The highest height-length index is for a 
Biometrika xxx1 
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TABLE III 
Mean measurements of series of Iron Age and Romano-British 
cransa from Maiden Castle 
Male Female 
Belgic War Belgic War 
Cemetery Others Total Cemetery Others Total 

L 187-7 (14) | 189-9(9) | 188-6 (23) | 179-0(7) | 180-5 (19) | 180-1 (26) 

B 141-4 (14) | 140-0 (10)| 140-8 (24) | 135-7 (6) | 135-8 (16) | 135-8 (22) 

Hi’ 137-1 (14) | 136-9 (8) | 137-1 (22) | 134-4(7) | 131-7 (15) | 132-6 (22) 

B’ 97-2 (14) | 96-9(12)} 97-1 (26) | 92:5(7) | 94-1 (19) | 93-7 (26) 

Ss 384-5 (11) | 381-9(7) | 383-5 (18) | 371-3 (6) | 365-6 (10) | 367-7 (16) 

U 528-6 (12) | 530-9 (8) | 529-5 (20) | 504-0 (6) | 508-1 (15) | 507-0 (21) 

fml 37-8 (13) | 37:3(8) | 37-6(21) | 35-9(5) | 35-3(11) | 355 (16) 

fmb 31-6 (12) | 31-2(8) | 31-5 (20) | 28-2(5) | 28-9(10) | 28-6 (15) 

LB 102-5 (13) | 102-2 (6) | 102-4(19) | 96:7(7) | 99-9 (12) | 98-7 (19) 

G@’H 73-3 (8) 71-3 (6) | 72-4(14) | 68-1(6) | 67-2(9) | 67-6 (15) 

NH,L 52-4 (9) 51-2(7) | 51-9(16) | 46-7 (6) | 49-1 (12) | 48-3 (18) 

NB 25-0 (8) 25-1 (8) | 25-0(16) | 22-4(5) | 24:3(8) | 23-5 (13) 

0,L 43-6 (9) 41-2(6) | 42-6(15) | 41-3(5) | 41-3(9) | 41-3 (14) 

On 33-2 (9) 32-3 (6) | 329(15) | 32:5(5) | 31-9(9) | 32-1 (14) 

100 B/L 75-4 (14) 73-5 (9) 74-6 (23) 76-6 (6) 75-5 (16) | 75-8 (22) 

100 H’/L 73-4 (13) | 72-4(8) | 73-0(21) | 75:2(7) | 72-9(15) | 73-6 (22) 

100 B/H’ 103-3 (13) | 102-7 (8) | 103-1 (21) | 101-6 (6) | 102-5 (13) | 102-2 (19) 

100 fmb/fml 83-5 (12) 83-7 (8) 83-6 (20) 79-2 (5) 82-9 (9) 81-5 (14) 

100 NB/NH, L 48-6 (7) 50-2 (6) 49-4 (13) 48-1 (5) 50-0 (8) 49-3 (13) 

100 0,/0,, L 76-3 (9) 78:5 (6) | 77-2(15) | 788(5) | 77:2(9) | 77-8 (14) 

PZ 86°-7 (8) | 86°-0(4) | 86°-5 (12) | 89°-0(3) | 83°-9(9) | 85°-2 (12) 

Total series 
Male Female Male Female Male Female 
BQ’ | 316-7 (20)| 305-3 (23) BY’ | 122-4 (17) | 114-7 (16) Se 10-1 (16)| 9-3 (15) 
S,’ | 114-2 (24)| 110-2 (22)| | Biast. B | 114-3 (19) | 107-2 (14) Oc.1. 58-7 (22)| 60-5 (17) 
S,’ | 116-5 (22)| 112-2 (21) iL 93-4 (14) | 90-3 (11) 100 G’H/GB | 76-3 (13) | 73-5 (11) 
8,’ | 99-9 (22)] 97-0 (17) 95-2 (18)| 89-4 (14)| |100G,/G,’ | 88-7 (12)| 87-4 (7) 

S, | 132-0 (22)| 126-6 (22) J 133-6 (11) | 123-2 (11) 100 SS/SC 47-6 (16) | 47-6 (14) 
S, | 129-9 (21)] 125-3 (21) G,’ 45-6 (19)| 44-0 (12) NZ 61°-0 (14) | 63°-2 (11) 
S, | 122-3 (22)] 116-0 (17) G, 40-7 (16) | 39-0 (11) AZ 76°-3 (14) | 75°-2 (11) 
C* |1514-7 (19)]1380-7 (19) SS 4:7 (16)| 4-3 (14) BZ 42°-6 (14) | 41°-6 (11) 


* Reconstructed from L, B.and H’. 


male cranium (P 7A) in the same series. In spite of these outlying cases, the 
distributions for the two characters in question, and those for the other cha- 
racters, provide no clear evidence of racial heterogeneity, though it cannot be 


inferred from them that the total population considered was racially homo- 
geneous. 
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If the supposition that all the skulls of Iron Age and Romano-British date 
from Maiden Castle represent a single population be accepted, the series available 
is still not long enough to give reliable racial comparisons of a statistical kind. 
A short series is liable to indicate an absence of differentiation from other series 
when one of an adequate length from the same population will provide evidence 
of distinction, and it will also be liable to give a misleading estimate of diver- 
gence from other types when distinction is clearly indicated. Large enough 
samples must be demanded, in particular, when making comparisons between 
closely related populations, such as the group made up by all the prevailing 
populations of England from the end of the Bronze Age to modern times. In 
spite of its restricted size, coefficients of racial likeness were computed for 
male means between the total ‘Iron Age” series from Maiden Castle and four 
others, viz.: 

(a) The Anglo-Saxon made up principally by skulls preserved in London 
Museums (Morant, 1926). 

(b) The so-called British lron Age (Morant, 1926) made up by skulls from 
the south of Scotland and various parts of England. Several of the specimens 
are known to be of Romano-Britons, and some of the others are not dated 
satisfactorily, so the series is of little value. 

(c) The seventeenth-century series from a plague in Whitechapel (Macdonell, 
1904), the revised means given by Hooke (1926) being used. 

(d) The seventeenth-century series from the cemetery in Farringdon Street 
(Hooke, 1926). 

The following coefficients of racial likeness are found for the total “Iron 
Age” series from Maiden Castle and these four, the numbers in brackets following 
the crude values being the numbers of characters on which they are based, and 
the numbers in brackets following the names of the series being the average 
numbers of skulls on which the means used are based (7%’s)*: 


Crude C.R.L. Reduced C.R.L. 


Maiden Castle (17-7) and Anglo-Saxon (34-1) — 0-28 +0-18 (29) — 
A (18-5; and British Iron Age (55-4) 2-98 + 0-21 (20) 10-73 40-77 
(17-7) and Whitechapel (90-7) 2-79 +0-17 (30) 9-43 +0-59 


ae (17-8) and Farringdon Street (96-7) 4-29 +0-17 (30) 14-27 +0-58 


As far as can be told from the scanty evidence, the Maiden Castle and Anglo- 
Saxon samples might represent different sections of the same population, while 
the former series is clearly differentiated from the other three. Previous 
comparisons have shown that the British Iron Age and the two seventeenth- 
century London types are very similar, while the Anglo-Saxon stands apart from 
that cluster. These relationships suggested that the Londoners were descended 
primarily from the pre-Saxon rather than from the Anglo-Saxon population, but 
the new evidence does not support this view. 


* The standard deviations of the Farringdon Street series were used in computing the coefficients. 
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For characters considered singly, there are very few significant differences 
between the means for the short Maiden Castle series and those for the three 
later series. All the differences from the Anglo-Saxon values are quite in- 
significant, the highest of the 29 a’s being 3-8. The only «’s greater than 10 are 
for the nasal angle (NZ, 15-0) and basio-bregmatic height (H’, 11-8) in comparisons 
with the British Iron Age means, for H’ (39-7), 100 H’/L (32-7) and 100 B/H’ 
(28-7) in comparisons with the Farringdon Street means, and for H’ (18-9), 
100 H’/L (15-3), NZ (15-2) and 100 B/H’ (10-5) in comparisons with the White- 
chapel means. The Maiden Castle skulls are markedly orthognathous judging 
from the nasal angle, but this measurement is only available for 14 male skulls 
and little stress can be laid on the peculiarity. Otherwise the type is only 
distinguished by a calvarial height which is large both absolutely and also 
relative to the length and breadth of the brain-box. It is known that the 
Anglo-Saxon is distinguished in precisely the same way from the other types 
considered. 

Table IV gives male means of the three principal diameters and the three 
indices derived from them for the series referred to above and the following: 

(e) Three series of Romano-British skulls described by Buxton (1935), the 
six measurements in question being the only ones available for these. 

(f) An Anglo-Saxon series from Burwell, Cambridgeshire (Brash et al. 1935). 

(g) A third seventeenth-century London series from a burial-pit at Moorfields 
(Macdonell, 1906). : 

(hk) A modern series of Lowland Scottish skulls measured by Turner (1903) 
and compiled in the way described by Hooke (1926, pp. 22 and 38). 

({) A modern series from Glasgow (Young, 1931). 

All the series included in the table are believed to represent populations 
which prevailed in England and the south of Scotland at certain periods from 
the beginning of the Iron Age to modern times, and the minority populations for 
which there is craniological evidence are omitted. The length, breadth and 
cephalic index are seen to be remarkably constant throughout, while the basio- 
bregma‘ic height and the two indices involving this diameter show larger 
differerces. The greatest heights, highest height-length and lowest breadth- 
height indices are for the two series from Maiden Castle, and they thus show a 
remarkable concordance in spite of their very restricted sizes. In these respects 
one of the Romano-British series (the Brigantes) and the two Anglo-Saxon come 
next, and the others follow, with one other Romano-British (the Dobuni) and 
two of the seventeenth-century London series at the lower end of the scale. 
Little significance can be attached to this order, however, as some of the series 
are short. There are no statistically significant differences, for example, between 
the means for the total series from Maiden Castle and those for the Romano- 
British Belgae. The populations represented in Table IV are all remarkably 
similar in type, and they must have been closely inter-related. Larger series 
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TABLE IV 
Mean measurements of British series of male skulls 
Maiden Castle Romano-British 
Tron Age and “ British 
Belgi Iron Age” 
ritish series ic War 
Cemetery Others Total Belgae Dobuni | Brigantes 
L 187-7 (14) | 189-9 (9) | 188-6 (23) | 187-4 (61) | 189-6 (40) | 190-8 (85) | 189-9 (57) 
B 141-4 (14) | 140-0 (10)| 140-8 (24) | 141-4 (102) | 141-0 (41) | 144-2 (77) | 141-7 (57) 
H’ 137-1 (14) | 136-9 (8) | 137-1 (22) | 132-9 (77) | 134-2 (33) | 132-5 (45) | 136-8 (38) 
100 B/L 75-4 (14) | 73-5 (9) 74-6 (23) | 75-4 (61) 74-4 (40) 75-6 (71) 75-7 (51) 
100 H’/L | 73-4 (13) | 72-4(&) | 73-0 (21) | 70-9(61) | 71-0(33) | 69-4 (43) | 71-4 (34) 
100 B/H’ | 103-3 (13) | 102-7 (8) | 103-1 (21) | 106-3 (77) | 105-5 (33) | 109-9 (41) | 104-4 (33) 
| 
Anglo-Saxon { Seventeenth-century London Modern Scottish 
Other - 
London Farringdon White- 
cided Burwell Street chapel Moorfields | Lowland | Glasgow 
L 190-6 (58) | 189-6 (45) | 188-8 (139) | 189-1 (137) | 189-2 (44) | 188-8 (54) | 188-2 (524) 
B 141-7 (103) | 141-7 (45) | 142-4 (141) | 140-7 (135) | 143-0 (46) | 142-1 (54) | 139-1 (524) 
H’ 136-0 (31) | 136-3 (40) | 129-7 (118) | 132-0 (122) | 129-8 (34) | 133-6 (52) | 132-9 (521) 
100 B/L 74-7 (52) 74-8 (45) 75-4 (132) 743(131)| 75-5 (42) | 75-3 (54) | 75-0 (524) 
100 H’/L | 71-2 (25) 71-9 (40) | 68-6 (115) | 70-0 (120) | 68-4(31) | 70-9 (52) | 70-7 (521) 
100 B/H’ | 104-9 (61) | 104-3 (40) | 109-8 (117) | 106-6 (122) | 110-2 (34) | 106-4 (52) | 104-9 (521) 


representing the Iron Age population are obviously required, but the data 
available clearly focus attention on mean differences between the absolute and 
relative magnitudes of the calvarial height. It can be seen from the distribution 
on p. 301 above that all the male skulls from Maiden Castle have height-length 
indices greater than 69-0. In Table IV there are two seventeenth-century London 
series with means for the index less than this value, and one of the Romano- 
British series has a mean of 69-4. These conditions indicate unusual separation 
of the distributions for different series, and there is no reason to suspect that any 
cranial characters other than the height and the height indices would distinguish 
the populations represented in Table IV as effectively. 


4. MEASUREMENTS OF THE MANDIBLES 


Measurements of the mandibles were taken in accordance with the revised 
technique given in Biometrika (Morant et al. 1936, Appendix), and readings for 
individual bones are not provided in the present paper. It has been found 
(Cleaver, 1937) that larger numbers of lower jaws than of crania are needed to 
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give any decisive racial comparisons, and hence the means for all the adult 
specimens of Iron Age and Romano-British date taken together are the only 
ones worth considering. They are given in Table V for totals of thirty-four male 
and twenty-five female bones, and values for Anglo-Saxon (Morant, 1926) and 
a seventeenth-century London (Cleaver, 1937) series are included for com- 
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parative purposes. 


TABLE V 
Mean measurements of English series of mandibles 
Male Female 

Maiden Farringdon Street, Anglo- Maiden /|Farringdon Street, 

re Castle: Londen: seven- S Castle: London: seven- 

a Iron Age | teenth century — Iron Age | teenth century 

| 

uy 123-7 (25) | 121-5 (19) | 117-7+0-53 (23) | 116-6 (22) | 117-5 (12) | 110-8 +0-64 (30) 
Io9o 100-4 (33) | 99-5 (25) | 97-7+0-70 (40) | 92-9 (35) | 90-8(18) | 85-7+0-55 (50) 

Cy 100-3 (27) | 99-3 (20) | 95-9+0-68 (29) | 93-2 (28) | 95-9(18) | 91-7+0-54 (35) 

zz 45-3 (57) | 44-8 (33) | 43-940-24 (40) | 44-1 (50) | 42-5 (24) | 42-940-25 (49) 
21-7 (38) | 21-0 (27) | 19-8+0-20 (34) | 19-1 (35) | 20-0 (14) 18-040-16 (45) 
ml 107-2 (31) | 105-3 (24) | 104-1+0-64 (34) | 104-2 (45) | 98-9 (16) | 99-4+0-61 (43) 
cpl 77-2 (42) | 79-0 (27) | 74-9+0-40 (40) | 74-6 (49) | 70-5 (19) | 69-7+0-37 (50) 
rb’ 33-2 (61) 33-1 (33) | 30-9+0-28-(40) | 31-0 (56) | 30-4 (24) | 28-340-25 (50) 
MsP; 28-1 (59) | 28-4 (23) | 28-2+0-18 (22) | 27-6 (57) | 27-2(19) | 27-7+0-22 (19) 

hy 33-1 (40) | 34-7 (22) | 30-9+0-46 (12) | 30-5 (31) | 30-5 (17) | 29-7+0-36 (26) 
27-2 (51) | 28-2 (20) | 24-94+0-45 (19) | 24-4 (52) | 26-6(15) | 23-6+0-51 (12) 
c,h 65-7 (48) | 70-6 (28) | 64-8+0-47 (40) | 59-2 (47) | 60-2 (25) | 56-5+40-47 (50) 

rl 64-0 (45) | 66-2 (26) | 62-2+0-40 (36) | 59-1 (45) | 57-3(20) | 53-5+0-44 (43) 
MZ 120°-3 (47) |116°-5 (27) |121°-7 +0-60 (40) | 122°-5 (49) | 123°-1 (19) | 127°-8+0-59 (50) 
RZ 72°-0 (36) | 77°-6 (24) | 72°-0+40-94 (37) | 68°-2 (36) | 71°-6 (18) | 71°-2+0-87 (48) 
CL 68°-2 (32) | 69°-8 (18) | 61°-8+1-35 (16) | 70°-O (25) | 70°-3 (14) | 67°-3+0-80 (27) 
100 c,h/ml | 60-9 (27) | 67-9 (22) | 62-4+0-66 (34) | 58-3 (38) | 60-5 (16) | 56-8+0-49 (43) 
100 c,c,/ml | 94-4 (15) | 93-9 (17) | 92-3+41-03 (25) | 91-7 (26) | 97-1 (12) | 92-4+0-74 (30) 
100 go9o/cp| 129-0 (32) | 127-4 (25) | 130-9+ 1-23 (40) | 126-2 (35) | 129-2 (18) | 123-3+ 1-02 (50) 
100 rb’ /rl 51-5 (45) | 50-8 (26) | 50-0+0-54 (36) | 53-0 (43) | 53:5 (19) | 53-2+0-66 (43) 
100 99-3 (19) | 102-1 (18) | 102-94 1-07 (29) | 99-3 (23) | 96-2 (14) | 94-04-0-86 (35) 


None of these is long enough to give an adequate representation of the type 
for the population it represents, and probable errors are only available for the 
last. A few general comparisons are sufficiently suggestive, however, to be of 
interest. For both sexes the two earlier types are very similar in size, and for 
several characters both appear to be significantly larger than that of the 
Farringdon Street series. There is only one measurement of size which provides a 
clear exception to this rule—viz. the length of the dental arcade from second 
molar to first premolar (m,p,)—and it is the only measurement taken relating to 
the size of the teeth. Both Anglo-Saxon and Iron Age types appear to be 
distinguished from that of seventeenth-century Londoners on account of less 
protruding chins (C’2 greater), and the Iron Age appears to be distinguished 
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from the other two on account of a more outstanding coronoid process (judging 
from RZ and 100c,h/ml). Otherwise, there is no clear evidence of distinctions 
between the types. 

Judging from all the characters, the Anglo-Saxon and Maiden Castle popu- 
lations were just distinguished by features of their lower jaws, but the resemblance 
between them was closer than that between either and the later Londoners. The 
relations found favour the hypothesis that the mandibles of Englishmen, but 
not their teeth, became slightly smaller in historical times, but more data will be 
needed to substantiate it. If the hypothesis be accepted, then mandible measure- 
ments should not be used to estimate racial relationships unless allowance is 
made for secular changes in them within the same population. 


5. MEASUREMENTS OF THE LENGTHS OF THE LONG BONES 


Individual readings are given in Table VIII, and means of the adult series for 
the characters of greatest interest are in Table VI. The lengths were taken in the 
ways specified by Miinter (1936), whose means for Anglo-Saxon skeletons are 
quoted. In spite of the small numbers, it was thought worth while computing 
separate means for (a) the Belgic War Cemetery series, and (6) for all the other 
specimens of Iron Age or Roman date from Maiden Castle. 

For both sexes all the absolute measurements for these two series in Table VI 
are less than the corresponding values for Anglo-Saxons. A rough appreciation 
of the significance of the differences between the means can be obtained by 
supposing that the two Maiden Castle populations considered, and also that 
made up by combining them, exhibited the same variabilities as the total 
Anglo-Saxon population. This may appear to be a very arbitrary assumption, 
but in fact it is not unreasonable, since a close approach to equality in variation 
is usually found on comparing different subgroups with a total population, and 
also on comparing distinct populations. 

By applying the Anglo-Saxon standard deviations, we reach the conclusion 
that there are no significant differences between the means of the absolute and 
indicial measurements in the case of the Belgic War Cemetery and the other 
series from the same site, and this is true for both sexes. On the same assump- 
tion with regard to variation, it is found that the pooled Maiden Castle means 
only show markedly significant differences from the Anglo-Saxon in the case of 
the length of the femur for males, the length of the humerus for males and 
females, and the radius-humerus index for males. The differences between the 
statures cannot be supposed clearly significant in the case of either sex, but it is 
safe to conclude from the evidence of both that the Iron Age inhabitants of 
Maiden Castle were shorter than Anglo-Saxons..The sex ratios for stature are 
almost identical, being 1-076 for the former and 1-073 for the latter series, and 
of the order usually found. In inches the estimates of height obtained are 
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TABLE VI 


Means for lengths of the right femur, tibia, humerus and radius, and indices derived 
from these lengths, for Maiden Castle (Iron Age and Romano-British) and 
Anglo-Saxon series of adult skeletons 


Male 

Other than 

Cemetery 
F. max. 437-6 (11) 443-2 (15) 440-8 (26) 463-3 + 1-22 (153) 
T. max.* 362-8 (11) 371-2 (18) 368-0 (29) 378-9 + 1-46 (103) 
H. max. 323-0 (12) 326-8 (17) 325-2 (29) 337-14+1-15 (121) 
R. max. 240-8 (10) 250-9 (15) 246-9 (25) 251-6 + 1-12 (79) 
100 T. obl./F. obl. 81-2 (10) 82-5 (14) 82-0 (24) 81-1+0-17 (92) 
100 H. max./F. obl. 73-9 (11) 74:3 (13) 74-1 (24) 73-5+0-14 (100) 
100 R. max./H. max. 75-4 (10) 77-2 (15) 76-4 (25) 74-6+0-19 (62) 

100 (H. max. +R. max.)/(F. obl. max.)t 71-2 (9) 71-6 (12) 71-5 (21) 70-7 + 0-16 (41) 
Reconstructed stature 1640 (13) 1654 (19) 1649 (32) 1683 (161)t 
Female 
F. max. 411-9 (13) 411-0 (8) 411-5 (21) 426-1 + 2-05 (56) 
T. max.* 336-4 (14) | 342-7 (7) 338-5 (21) | 350-44 1-99 (44) 
H. max. 300-8 (11) 296-4 (8) 298-9 (19) 312-5 + 1-54 (47) 
R. max. 223-0 (13) 215-1 (7) 220-3 (20) 227-6 + 1-41 (34) 
100 T. obl./F. obl. 81-5 (13) 81-2 (6) 81-4 (19) 80-8 + 0-20 (38) 
100 H. max./F. obl. 74:5 (8) 72-3 (7) 73-5 (15) 73-6 + 0-25 (36) 
100 R. max./H. max. 74:2 (9) 73-4 (7) 73-8 (16) 73-5 +. 0-22 (31) 
100 (H. max. +R. max.)/(F. obl.+T. max.)t 70-8 (7) 69-7 (5) 70-3 (12) 70-6 +0-21 (21) 

Reconstructed stature 1535 (17) 1527 (9) 1532 (26) 1568 (59)t 


* Maximum length of the tibia including the spine. 


larger numbers of skeletons. 


+ Maximum length of the tibia excluding the spine. 
t The reconstructed statures for Anglo-Saxons were found from the mean lengths for different long bones, 


and the numbers of individuals given are the numbers of femora involved. The statures actually relate to rather 


5 ft. 6} in. for Anglo-Saxon and 5 ft. 5 in. for Iron Age men, and 5 ft. 1} in. for 
Anglo-Saxon and 5 ft. 0} in. for Iron Age women. The average stature of the 
general male adult population of England to-day is about 5 ft. 7} in., and for 
different social classes means between about 5 ft. 5}in. and 5 ft. 9} in. are 
found. The Iron Age men were thus decidedly short compared with modern 
Englishmen, and the estimated stature for them is very close to the average 
found for all European populations to-day. 

For the measurements considered, the only distinction between the two 
ancient populations other than that in size is found for the index expressing the 
length of the radius as a percentage of the length of the humerus in the case of 
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the male, but not in that of the female, sample. It might be suggested that the 
proportions of the upper and fore-arms of the men who lived at Maiden Castle 
were influenced on the right, or on both, sides by continual practice in the use 
of the sling, which is known to have been one of their principal weapons from 
the evidence of caches of stones. The following means, which relate only to 
paired indices, are of interest in this connexion: 


100 x (H. max.+R. max.)/ 
100 x Radius max./Humerus max. (F. obl. +T. max. ex spine) 


Maiden Castle Anglo-Saxon Maiden Castle Anglo-Saxon 


Male Female Male Female Male Female Male Female 


R. | 76-3 (18) | 73-6 (14) | 74-6 (25) | 74-0 (14) | 71-5 (11) | 70-3 (12) | 70-9 (21) | 70-4 (9) 
a 78-4 (18) | 75-0 (14) | 75-7 (25) 74-6 (14) 70-8 (11) | 68-9 (12) | 70-0 (21) | 69-6 (9) 
R.-L. |—2-1 —1-4 +0-7 +1-4 +0-9 


All the series are very restricted in size, but there is complete agreement in 
the signs of the side and sex differences found. The most significant difference 
between the two populations appears to be that for the male radius-humerus 
index on the left side. This might suggest that the Maiden Castle men were left- 
handed slingers, but their intermembral indices are so close to the Anglo-Saxon 
values that it seems unsafe to accept the hypothesis that the lengths of their 
arms were affected by use. More abundant material might tell definitely for or 
against such a view, and a detailed anatomical examination of the arm bones of 
the Maiden Castle and modern series would also be relevant to the question. 


6. SUMMARY AND CONCLUSIONS 


The material studied consists of the imperfect skeletal remains of eighty- 
three individuals, who are distributed in Table I according to periods, sexes, 
and ages at death. This paper is concerned chiefly with the customary measure- 
ments of the crania and mandibles, and with the lengths of the long bones. 
Comparisons of various kinds are made between: 

A, the Belgic War Cemetery series, representing the defenders of the Castle 
who were massacred by Roman invaders in a.p. 43, and 

B, a composite series made up by all the other specimens of Iron Age and 
Roman date. 

There is no clear distinction between the distributions of the ages at deaths 
of the adults forming the two short series, but for both the men died at a rather 
younger age, on the average, than those interred in English cemeteries of later 
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date, while the females are not distinguished in the same way. Remarks on 
cranial anomalies are provided. The teeth of the inhabitants of Maiden Castle 
were not well preserved. 

Judging from their appearance, nearly all the crania in series A and B would 
be considered quite unexceptional if found in any British collection of later date 
than the Bronze Age, but there are few examples of the markedly retreating 
frontal bone which characterizes seventeenth-century Londoners. There is 
agreement between the male and female samples and this is confirmed by 
measurements of the skulls and long bones. 

There are no statistically significant differences between the mean cranial 
(Table III) and long bone (Table VI) measurements of series A and B. Both are 
characterized by a large calvarial height, and the estimates of stature they give 
are about | in. less than the Anglo-Saxon values. The type of the total series 
(A +B) of crania is found to be indistiriguishable from the Anglo-Saxon, while 
both are differentiated from the types of seventeenth-certury London series. 
This conflicts with the conclusion reached previously, from very inadequate data, 
for Iron Age and Romano-British crania, to the effect that the Iron Age and 
recent types are very similar while the Anglo-Saxon stands apart from both. 
More abundant material representing the Iron Age and Romano-British popula- 
tions will be required to disclose the relationships of these closely allied groups. 
The cephalic index is practically constant for them, and the cranial types are 
distinguished most clearly by differences in the absolute and relative magnitude 
of the calvarial height (Table IV). 

Measurements of the mandibles make a slight distinction between the 
Maiden Castle and Anglo-Saxon series, while both types are larger than that of 
seventeenth-century Londoners. The Iron Age men, but not the women, are 
distinguished from Anglo-Saxons by having a larger radius-humerus index. It 
is not clear that this difference is due to the fact that the men at Maiden Castle 
were slingers. 


APPENDIX 


Tables (VII and VIII) of individual measurements and remarks 


The letters denoting periods (or groups) given in the third columns of the tables are: 
N.=Neolithic, I.A.= Iron Age (A, B or C), S.=Saxon, R.-B. = Romano-British, B. = Belgic, 
B.W.C.= Belgic War Cemetery. The letters denoting cranial measurements are those used 
in all craniometric papers in Biometrika. A list of them is given in the present volume, 
p- 162, but this does not include B’’=maximum frontal breadth (Martin, No. 10), and 
Biast. B.=biasterionic breadth (M. 12). Owing to their fragility, the capacities of the crania 
could not be determined by any direct method. The estimates given in Table VII were 
obtained by applying the reconstruction formulae involving LZ, B and H’ given by Hooke 
(1926, p. 33). The reconstructed statures in Table VIII were obtained by applying the 
formulae given by Pearson (1898), using as many as possible of the bones involved in each 
case. The lengths of the long bones were determined in the ways adopted by Miinter (1936). 
All the readings given in the two tables, whether queried or not, can be considered close ap- 
proximations to the true values, with the possible exception of the few measurements of 
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the Neolithic skeletons enclosed in square brackets. These are more uncertain than the others, 
but they are given because the specimens are of particular interest. 

Unless otherwise stated in the remarks given in Table VII, the cranium and mandible 
are complete, or almost complete, with both dental arcades intact and no teeth lost from 
either jaw before death, and the coronal, sagittal and lambdoid sutures are open. Absence of 
a third molar denotes that the tooth had never erupted, as far as can be told. Two male skulls 
are not in the tables, since no measurements can be given for them, but they were included 
in determining the frequencies of qualitative features. These are: 

P 24 (B.W.C.). Sagittal suture closed, coronal and lambdoid beginning to close. Upper 
jaw incomplete and some teeth lost before death, apparently only | incisor R.; 2 teeth lost 
from mandible and M 3’s absent, abscess cavities at roots of M I’s. 


R2 (?1.A.). Sagittal suture closing. Upper jaw missing. Superficial wound on frontal 
bone. 


LIST OF PLATES 


Plate I. A typical male skull (P 30) from the Belgic War Cemetery. This specimen has a cephalic 
index which is very close to the mean for the series, but its height-length index (70-9) is below 
the average (73-4) and it is peculiarly orthognathous (PZ =90°). The lower incisors are crowded. 


Plate II. Exceptional skulls from the Belgic War Cemetery. 
A. A female adolescent cranium (P 20) with injury to the right malar bone. 
B. A male cranium (P 7A) with a trace of a suture on the right parietal bone. 


C and D. A female skull (P 36) with an exceptional type of facial skeleton and high cephalic 
index (87-0). 


Plate III. Skulls from Maiden Castle, with dental and other anomalies. 
A. A female cranium (T 21) with cyst cavities at the roots of the second premolars. 
B. The broken basi-occipital part of a male cranium (O 3) showing the extension into it of 
sphenoidal air sinuses. 
C. The palate of a male cranium (P 7) with a large anterior palatine foramen. 
D. The palate of a female cranium (T 28) showing absence of the canines and rotation of the 
first premolars. 


E. A female mandible (P 14) with the left third molar impacted and mandibuler torus. 
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1937 N. 3 |[140] | 1005] — | — | — | 118-327] — | 
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03 LA.: B 89-0 -- 118? — 
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Paz 193°5 | 148 | 130°5 | 1162 | 138-5 | 122-1] 1196] 
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O02 3 198 149 95°8 | 130 114°5 | 141 12024 12973 | 103-2 
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T9 ? 1.A. or R.-B. — | 145? | 106-3 | 122? — | 1097} — 
T 16 | 1235] — — | 11633 — 
ez LA.: B 2 176 132 94°0 | 115 100 135 | 96-0 
T 13 182-5 | 141 98-5 — | ror? 135°5 | 107-9] 121-6 | 
T 14 183°5 | 134°5 | 94:0 136? | 1168 
T18 182 135°5 | — | 140 109°5 | 114°8 | 102-7 
T 24 169 127 93°I | 109 103 124 100°4 | 109-0 
Qi 9? 180 135 | 111 110 134 | 104°8 | 993 
Pit Y 1934 2 180 940} — — | 136 113-6 | 96°5 
04 1930 T.A.: C (B.) 185 132°5 | |] 117 10475 | 136 — | 111-6] 96-7 
N2 183°5 | 143 96-2 | 119 134 11472 | 117-1 | 92°07 
T6 1937 174 130 93°0 | III 125 | 106°5 
Ti12 ” 182? | 140°5 | 97-42) — 126? | 108-87), — 
T 21 180 93°5 | 110-5 | — | 124 | 116-0] 104-4] — 
T 22 192? | 138? | orn | — — | 132 | 115:3 | 115-12] 104-07 
T 26 182 135 97°32} — 110 
T 27 ” ” 171 138 94°3? — 
P5 L.A.: C (B.W.C.) 180 131°5 | | 11575 | 112 134 105°7 | 114:2 | 96-0 
P14 188? 96°3 | 119°5 = 139°5?] 111-8 | 118-4 
= 185 137? | 98:3 | 120 115 135 115-6 | 116-2 | 100-2 
P 26 188 136? 87-1 | 116 I10°5 | 139 118-3 | 118-0 97°9 
P31 ” ” 169 | | 108 104°5 | 126°5 | 105°0 | 106-9 | 92-2 
P 36 * % pee 169 | | 147 87-0 | 115? | 106°5 | 133 112-9 | 108-4 | 98-2 
P 37 ” ” 174 135? 94°8 | 118-52] 106% | 134? | r1r-r | 115°6 | 95-2? 
T 28 LA.: C or R.-B 188 95°5 
QR Il R.-B. 184:5?] 136? 88-4 — — | 1305 | 110-0? 
QR il “ 177 143 89-0 | 117? 107°3 
QR IV g 177°5 | 132 89-1 | 112°5 | 107-5 | 128 99°8 
P 20 1.A.: C (B.W.C.) | Adol. | 177°5?} 148-52] 103-4 | 132 — | 130°5?] 112-9 
T 29 2 ~A.: ae 179°5 | 140°0 g2-0 | 118 106 139°0 | 109-0? 


| 

| 

| 


GH J GB 
96°; 
72°8 | 132°5 98+ 
69°9 | 123? 
76°8 | 140 105+ 
89°! 
69-7 | 138 
721} — | 
— — 94° 
750 = 93° 
74°7 | 134 94° 
69°3 | 137°5 | 98: 
— 104" 
67-2 | 1375] 95° 
80-6 | 139 94" 
71-0 | 123 89° 
748 | 133 86° 
62-5 | 132 
83: 
— | 121 80- 
88- 
59°9 | 120 89: 
63-0 | 124 g2- 
84: 
68-57%] 120? 93° 
| 124°5 
692] — | 93 
69°3 | 127 
63°67] 121? 
653) — 86: 
74°0 | 126 97 
68-87} — 92: 
7217) — 
— | 120 
68-07} — 
62-0 | 124°5 


— | 120-42] 98-8? — | 535 | 318 +3 | 102-0 | 89°5 
ai— — igs! — I | 320 | 38°9 | 35°3 5? 
6 | 118 | 383 | 53 *5 | 106-1 | 92-5? 
“1 | 123-7 | 1023 54° | 322 | 3 85: 
. ‘2 | 98-4 | 305] 140 | 117 | 304 96 | 305 | 35°0 | 26:8 | 98-2 
25°59 1195 | 1232 | 360 1 — — 
115°5 | 119-47] 97°38 | 34 2 | 134°52] 407? | 5342] 3202] 33-0? 
142-21] 124-82] 107-22) 28:52] 143? | 1030 | 
2) 102-07) 43? | 133°57] 121? | 398 | 534 | 336 99°1 | 97:6 
8 123°5 | 375 521 | 313 | 39 = 
5 | 1 "3, | 1060 | 90-4 
115°0 | 112-43 96°82 32:5 —_ | 508%) 307 | 36-6? 95°4 
1128 | 115-2 135 | 128-5 | 382 5373| 310? = 106-5 | 96-4 
11369 | 34? | 127%] 123” | 384 | 335) 306 | 356 | 302 | 992] S50 
11607) 113-6? “5 | 120-0 | 127 379 523 | 105-0 | 91-4? 
8-2 | 108-2 | 102-0 | 21-5 559 335%] | 35°9 
| rat | 385. | $23 | 320] 335 | — 
68 _ 10 | 312 | 34°9 | 31-0 | 103-0 
+5 | 114°8 | 102-7 | | 125 121 285 | 33-0 | 29-0 94°4 
1090 | 86°0 | 14°5 | 125°5 | 100 | 44 506 | 298 | 34°8 | 28-1 | 104-2] 99°5 
104°8 | | 24 | 120 — | | 26-8 | 102-3 
967 |— | | 506 | Soe | 
| 37-0 | 285 | | — 
| 92°0% 30° | 130 | 112? | 373°5 | 521? 37°8 | 308 | 97-0] 87-0? 
116-0 | 104-4 | — | 388? | 536%] 305%) — | — 2-8 
| | | sit | 3or | — | 278] | o2 
| — — | — 
— | — | — | 3163) — | — 83-3. 
1184 | — | 315 | 132°5 87 | 523) 38-3 | 33:3 | 98-1 
-6 | 116-2 | 100-2 34°5 3 18 | 310? 39°9 | 29°71 
113-3 | 118-0 | 97°9 117 113 349 477 | 281 | 34:7 4°7 
|p 129 | 108-4 | 08 Tog? 371 | 306 | 38-72) 9 
st | — | — | | | 
1. | ola TH 123 _,| 335°5 | 50! | 292 | 36-2 | 3 92-67] — 
"8 | | | (3°5 26 520?) 40-9 | 31 8 
99 “1? 29 122°5?| 367 70 31-8 | 981 85° 
Gat | ’ 2? | 118% | 374 | 512 | 319 | 34 
109°0?] 118-24) 96°8 | 25? | 13 


TABLE VII 
Measurements of crania from Maiden Castle 


(| | a | | sc|ss| | 1004 100 10%, Oc.1. 
— 258} — | — | 431 38127] — | —] — 67:0? 
8 | 1325] 98:2] | 245 | 40° | 340 | 42-0 | — | 98 | | 1564 | 75-4 | | 103-3 | 646] 90-7 741 
= 50-72 | — | — | — 45:5 | 400 | 12°0 | 5°9 | 1574 | 72-0 72°5 | 6090] 87-3 
9 | 123? | 918 | | 23-8 | 39°5 | 328] — | 83 | 3°8 | 1317] 75°9 97°4 | 59°2 766 761 471 
8] 140 | 105-7 | 53°6 | 25-42] 45°8 | 32°6 | 49:2 | — | 14°5 | 3°6 | 15512] 69-6? | 72-72 95°7? | 582 47-4? 
89°82?) — — — | 43°27] — | — 11500] 73-4? | 72-92 | 100-7? | 56°82] 93-0? 
7 | 138 92-0 | 51-0 | 25°7 | 41-7 | 32°5 | 493 | — | | 1688 | 76-2 79°1 61-12] 87-7 75°8 50-4 
- — 94°67) — 22-27) — — | 493 | 408 | — | — | 1563?) 77:82 | 73°32 | 106-2? 
— — — — — — — 58-2? — 
o| — | 934] 57:7 | 23°9 | 42-9 | 33:7 | 45°5 | 40°02] 14-9 | 60 | 1496 | 77-4 | 766 | rorr | 586] 82-4 80-6 404 
7 | 134 94°3 | 504 | 268 | 44°3 | | 48-0 | 38:7 | 92 | 4:5 | 1504 | 77-4 739 | 1048 | 584] 70°8 79°2 53°2 
3] 1375] 988] 48:8 | 25°09 | 43°7 | 336 | 467 | — | | 3:2 | 1651 | 765 706 106-9 | 56:2 87-7 5371 
—|— — |3¢0| — | — | 1391 | 803 111-7 | 58-6 
— | to¢9] 578 | — | 4081 — | — | GO 764 75°3 | 57°5 83°5 = = 
2 | 1375. | 95°09] 50:7 | | 44°6 | 36°3 | 43°5 | 403] Ot | 4:8 1545 7571 70°9 105-9 | 56°0 86-2 45°6 
| 139 94°3 | 54:2 | 268 | 47°9 | 33-4 | 45°2 | 41-1] | | 15542] 72-02 | 71-5 100-7 | 59°0 79°6 85°5 49°4 
o | 123 89-0 | 47°38 | 23°9 | 40°0 | 30-4 | 46-2 | | 8-9 | 5°6 | 1453 | 69°3 109°3 | 57°3 84:8 79°8 50°0 
“8 | 133 86-9 51-4 22-9 | 42°6 | 34°60 | 47-2 | 43°6 | I0°5 | 6-1 | 1522 75°3 71-2 105°7 62-4 88-6 86-1 44°6 
“5 | 132 gl-2 50°1 28-6 | 43°0 | 31-0 | | 45°0 | 9:8 | Gr | 1534 7371 69°4 105:2 55°90? 79°4 68-5 57°71 
| 469 | 27-2 | 41-3 | 31-0 | 47°71 | 394 | | 25) — — = 58-0 
83-5 | 48-5 | | 41-3 | 31-4 | 40°62) — | 10-0 | 61 | 1347 | 75-0 97°8 | 63:0 752 443 
— | 121 484 | 24°71 | 40°6 | 32-90] — | — | | 4°6 | 1475 | 77:3 742 | 60°5 49°8 
88-4 | 53°0 | 25°5 | 39°2 | 31-8 | 47-2 | 36:9] | 5°7 | 14282) 73-3 | 980? | — — 79°3 48-1 
| 120 89-3} 45:1 | 27°8 | 389 | 30°5 | 45°0 | 38:5 | 8:5 | 3: | 1173 | 75:1 73°4 102-4 | 63-4 87°9 67-1 61-6 
30 | 124 92°3 | 44-4 | 226 | 43°0 | 31-3 | 47°6 | 41-2 | 10-9 | 4°8 | 1391 | 75-0 74°4 100-7 | 59°4 80-7 68-3 50°9 
— | — | 841] 485 | — | 446] 358) — | — | 93] 43] 1486) 779 | 73:0 | 106-7 | 588% 77-0 = -~ 
3-57] 1207 93°5 — | 3057] 35:3 | — | — | 12341 74°7 | 59°6 81-5 73°3? 
— 524 | 23°8 | 43:2 | 32°77 | — | 37-22] 82] 43) — 68-9 45°4 
— 93°07] 46-9 | — | 41-2 | 31°82) 46-4 | 44°00 | — | — | 14792) 71-92 | 68-82 | 1045? | 5927) — 74°42 
x3 | 127 = 40°4 | 92 = = = = = 52°3 
3°67) 121? i — — | — — | — | 1360] 73-1 74°4 981 | 61-0 
| 12775 |] 918 49°5 22-4 | 39°9 | 32°77 | -— — | 10°9 | 4-4 1451?) 74-1? 7370 101-5? | 58-8 86-9 75°2 45°53 
40 | 126 | 50°0 | 23:7 | | 33:7] — | 402] — | — | 15002) 72-32 | 73-9 97°82 | 58:3 729 76-3 47°4 
91-0 | 431 | 24°5 | 39°3 | 20°38 | 48-7 | — | 12-82] — | 1408 | 87-0 78-7 110°5 89-9 74°4 56°8 
8-87, — 92-9 | 45°9 | 21°07] 44-3 | 33°0 | 445 — | 7:7 | 40 | 13512 77°62 | 77-02 | 100-7? | 65-72) 74-9? 741% 45°82 


| 
| 
—|120 — | — — | — | — | — 13978] 73:72 | 70-72] | — 
2-0 | | 890] 47-4 408 | 393 | 68 | 36] 1207 | 74-4 | 721 | 1031 | 505 | 898 69:7 v7 
— | 403} — | — | 14582] 83:72 | 73:52] 113-82 | 61-62) -77°3 = 
= 51-0 = 390 | — | — | 1477 | 780 77°4 100-7 | 58-7 93°5 4-2? 

| 
| 

— 


fb NB Os 88| p No 
~ — — —| — | R5 Incomplete skull-cap only; restored fr 
—| — | Pit3 Upper jaw defective but probably no 
82-3 —_ — 03 Sagittal suture closed, coronal and 
extension of sphenoidal air sinuses. | 
80-2 —| — Sagittal suture partly obliterated, cor 
88-4? — All sutures nearly obliterated. Some 1 
74°1 445 83-7 | 86°52] 59° Tir Sagittal suture closed, coronal beginni 
87:9 49°2 — | 58° 717° 45° | T20 Sagittal and coronal sutures beginning 
76°6 76-1 47°1 83-0 oo 45 — | 58° 78° x T25 Sagittal suture beginning to close. M« 
we — |N1 Sagittal suture beginning to close. 
726? —| — | P2 Sagittal suture nearly obliterated, cor 
95"1 72°7 47°4? 71-2 — 24°8 | 87° 60° 77° | 43° P6 Coronal suture beginning to close. So 
93:0? — — —| — |P7 Sagittal suture closing. Probably no te 
87- 8 9 85° 63° P7A wo upper molars lost before death. 
3 5 85° 67° 70° | 43° P9 Coronal and sagittal sutures beginning 
pat 82:8 — — Pir Posthumously distorted to some exte! 
ane —| — | Pi2 Largely defective. Palate defective v 
cavities 
82- 80-6 I 8-6 87-97 | 87°5 | 50°°5 | 77° | P18 Nasal bones fused 
wound. Upper M 3’s absent 
—| — | P22 suture closing, coronal and la 
19:8 2 2 80-6 8-9 | 86°? | 64° Pi ae” P 23 sagittal suture obliterated, coronal beg 
= = P25 Sagittal and lambdoid sutures oblitera 
87-7 53°1 76-9 | 88°52] 63°%5 | 76°] 40°5 | P27 Sagittal suture closed, coronal and lai 
wound on frontal bone 
— —| — | P28 Sagittal suture closed, coronal and la 
86-0 47°38 P 29 Sagittal and coronal sutures beginnin 
86-2 70°7 45°6 81-4 93°8 52-7 | 90° 59° 81° | 40° P 30 Sagittal and coronal sutures beginnin 
79°6 85:5 49°4 69°7 32°3 | 85° 60° 46°5 | P34 Abscess cavities at roots of upper M 
84:8 79°8 50-0 76-0 87-7 62-9 | 86°-5 | 60° 76° | 44° O1 Coronal and sagittal sutures beginnin 
88-6 86-1 44°6 81-2 92°4 58-1 | go° 8° 78° r O2 Sagittal suture beginning to close. 1 
79°4 68°5 57°1 72-1 62:3. | 81°? | | 78°) 36°5 | QRV Upper M 3, L and both lower M 3’s a 
58-0 7571 83:7 20°7 — —| — |T9 Sagittal suture nearly obliterated. Man 
— —| — |T16 Upper jaw defective and 1 molar lost 
75°2 44°3 76-0 61-0 — — Tr Sagittal suture closed, coronal and la 
49°38 81-0 40°4 T 13 Metopic suture beginning to close, co 
4°1 78-2 62-6 86°? — T 14 Sagittal suture obliterated, coronal c 
88-8 — — |T18 Sagittal suture nearly obliterated, la 
87: 67°1 61-6 8- 6- 80° 69° T2 
3 50°9 ane Bo° 68° Q Malar bone divided by horizontal sut 
81-7 wile — — — | Pit Y No mandible. Probably no teeth los 
83-2 oe —| — | 04 Sagittal and coronal suture closing, 
palate and 2 lost from mandible 
+2? “4? 87°? | 60°5? T6 Sagittal and coronal sutures closed, « 
one = Ti12 side of palate only, from which n« 
2: 85° 69° =| 67° terion in K, R. and L. Lower M 3’ 
74°42 94°8 — T 22 Depressed wound on R. side of front 
aa ak 52°3 piel cm 35°9 87° 63° 76° | 41° T 26 Sagittal suture obliterated, coronal n 
—| — |P5 All molars and 1 incisor lost from u 
at root of f1,R 
aa 758 45°3? 82-6? 80:8 29°3 _ 59°? | 80°? 41°? | P14 Upper M 3’s absent; lower M 3,6 in 
86:9 75:2 45°3 82-0 40°4 89° 56°5 | 80° 43°5 | P19 Coronal suture-beginning to close. 
729 76°3 47°4 78-9 mi —_ 94°? | 55°5 | 79° 45°5 | P26 Two teeth lost from upper jaw befor 
71-2 P 31 Sagittal and coronal suture closing, | 
and most teeth lost 
89°9 74°4 56:8 75°8 84° 66°°5 1° 42°%5 | P 36 Coronal suture beginning to close. 
74°92 74°12 45°83? 74°5 pa 519 — | 63°? | 72°? 45°? | P37 Sagittal and coronal sutures beginn 
posthumous distortion 
— — |T28 Sagittal and coronal sutures closing. 
ome — 86° — — | QRII Sagittal and coronal sutures beginnit 
Large anterior palatine foramen 
— 50°9 — — — | QRIII) Sagittal and coronal sutures beginni 
“8 69 2 96 52-9 | 81 65° 6° 39° QR IV 
= ph, P20 Basal suture obliterated but M 3’s n 
58°°5 » M 3's not fully erupted 


+ 
| 
4 
£ 
| 
_ 
bei 
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g. Cranium restored from a considerable number of fragments and defective owing to mutilation 

only; restored from a considerable number of fragments 

but probably no teeth lost before death 

d, coronal and lambdoid closing. Blunt spine projecting horizontally anterior to opisthion. Basi-occipital broken exposing 
idal air sinuses. 1 upper incisor lost before death 

y obliterated, coronal and lambdoid closed. Several teeth lost from upper jaw before death; mandible defective 

literated. Some teeth lost before death from both jaws 

i, coronal beginning to close. Some teeth lost before death from both jaws; all M 3’s absent 

sutures beginning to close. Fronto-temporal articulation R. and L. Some teeth lost before death from both jaws 

ning to close. Most teeth lost before death from upper jaw but none from mandible 

ning to close. Healed wound on right parietal 

y obliterated, coronal and lambdoid partly obliterated. Upper M 3’s probably lost before death 

~— close. Some teeth lost before death from both jaws 

g. bably no teeth lost before death and lower M 3’s probably absent. Abscess cavity at root of lower PM 2,R. Large anterior 


st before death. Trace of suture (33 mm.) on right parietal 

sutures beginning to ciosse. 6 teeth lost before death from upper jaw 

ted to some extent 

alate defective with several teeth lost before death and 2 abscess cavities; 4 teeth lost before death from mandible and 2 


1, te nearly obliterated, lambdoid closing. Marked posthumous deformation. Depression above left orbit probably due to 
’s absent 

1g, coronal and lambdoid beginning to close. Both jaws incomplete, several] teeth lost before death 

rated, coronal beginning to close: nodistortion apparent. Milk molar retained in place of lower left second premolar which is missing 
d sutures obliterated, coronal closing. Palate incomplete and several teeth lost before death; mandible complete and 4 teeth Jost 
d, coronal and lambdoid closing; occipito-mastoid sutures obliterated. 4 molars lost before death from each jaw. Small healed 
one 

d, coronal and lambdoid beginning to close 
sutures beginning to close. Nearly all teeth lost from both jaws before death 

sutures beginning to close. Upper M 3,R absent, lower incisors crowded 

‘oots of upper M 1,R and M2,R 

sutures beginning tu close. Lower M 3,L absent 

ining to close. 1 upper molar lost before death 

oth Jower M 3’s absent 

; obliterated. Mandible defective but probably no teeth lost before death. Swelling on right maxilla below orbit, possibly dueto tumour 
}and 1 molar lost before death 

d, coronal and lambdoid closing. Several teeth lost from upper jaw before death. Slight wound on frontal bone 

nning to close, coronal and sagittal closing. Most teeth lost from both jaws before death 

erated, coronal closing. 1 upper molar lost before death 

ly obliterated, lambdoid closed, coronal closing. Most teeth lost from both jaws before death 


by horizontal suture R. and L. 

ably no teeth lost from upper jaw before death 

1 suture —— lambdoid beginning to close. Button exostosis on frontal bone. All teeth lost before death from incomplete 
rom mandible 

nning to close. All teeth lost before death except 2 in mandible 

| sutures closed, coronal closing. 2 teeth lost from upper jaw before death; lower M 3’s absent 

ly, from which no teeth had been lost before death 

d L. Lower M 3’s absent. Cyst cavities in buccal surfaces of maxillae: see Plate IIIA 

n R. side of frontal bone above pterion 

erated, coronal nearly obliterated, lambdoid closed. 3 teeth lost from upper jaw before death and 1 from mandible 

vid sutures obliterated, coronal partly obliterated. Palate defective; several teeth lost from mandible before death 

cisor lost from upper jaw before death and disastema between central incisors; 1 molar lost from mandible and abscess cavity 


; lower M 3,£ impacted but eruption nearly complete. Mandibular torus R. and L. Cranium slightly distorted posthumously 
nning to close. Most molars lost before death and abscess cavities in both jaws 

| upper jaw before death; lower M 3,L absent and apparently only 3 incisors erupted 

1 suture closing, lambdoid beginning to close. Palate incompiete and most teeth lost before death; lower dental arcade complete 
it 
nning to close. Several teeth lost from both jaws before death. Facial skeleton of unusual form: see Plate II,C and D 

il sutures beginning to close. Several teeth lost from both jaws before death; abscéss cavity at root of lower M 2,R. Slight 
‘tion 

1 sutures closing. Upper canines apparently never erupted and first premolars rotated: see Plate III D 

| sutures beginning to close. 2 upper and 1 lower molar lost before death; cyst cavity in buccal surface of mandible below M 2 R. 
atine foramen 

| sutures beginning to close. 3 upper molars lost before death 


rated but M 3’s not fully erupted. Lower incisors crowded. Malar bone R. deformed 
opted 


| 
| 
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a 
te 
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| 
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nN 


QR Ill 


LA.: 0 (B.) 


g 
Adol. 
9, Adol. 


Year 
Oe No. | exca- Sex Maximum Oblique Maximum Max. ex spine Obliqat i 
vated 
T4 454 | 458 | | 454 | 367 | 368? | 362 | 363 | 361 
Ti10 » 458 | 462 | 456 | 460 | 381 — | 376 — |374 
Tir 439 | 442 | 437 | 440° 372, | 371 «| 360 368 
T 20 435 | 437 | 432 | 434 | 351? | 350 | 347? | 346 | 343) 
T 25 | 420 | — | 417, | — | — | — | 397 
Ni 1936 454°5 | 457°5 | 452? | 452°5 | | 359°5 | 355°5 | 355°5 | 354°5 
P2 1937 | I.A.: C (B.W.C.) 452, | 452, | 449 «| «| 381 «| 378 379 «| 370 
P6 452 | — | — | — | 37% | 379 | 373 | 376 | 371 
zy 453 | 456 | 450 | 454 | 386 | 386 | 381 | 383 | 380 38 
P7A 451 | 460 | 446 | 457 | 382 — | 379 — |377 
Po ” 429 | 430 | 426 | 426 | 338 — |33 
Pir 453? | 448? | 4497 | 446? | 367 | 367? | 363 | 363? | 362 
Piz — | 384 | 384 | 380 | 380 | 370 
} P18 ” ” ” 495 491 412 414 4ir 406 
| ,, 401 | 405 | 397 | 401 | 329 | 329 | 324 324 | 324 
P22 — | 431 — | 429 | 3567 | — | 353? | — | 351? 
P23 | 458? | 459 | 4532 | 456 | 377 | 378? | 374 375? | 373 
P24 451? | 459 | 4507 | 456 | 375 | 381 «| «| 371 
P25 474? | 477 | 470? | 475 | 398 — | 393 — 
P27 419 | 422 | 417 | 420 | 353 | 357 | 348 | 352 347 
P28 411 | 420 | 407 | 416 | 326? | 330 | 321? | 322 | 3207 
P29 416 | 420 | 414 | 419 | 359 | 357 | 352 | 353° | 351 
P 30 460 | 462 | 457 | 459 | 3908 | 297 | 304 | 304 | 393 
P3 ” 468 | 466 | 466 | 463 | 385 | 389 | 383 | 386 | 380 
eee 1936 R.-B. 3? 445? | 449°5 | 443? | 446 | 364-5?) 366°5 | 361-52) 362 | 360? 
421? | — | 41852] — | 348? | — | 347? | — | 3435? 
1937 444 | 445 | 438 | 439 355 | 362 | 354 | 360 | 352 
?1.A. or R.-B. 444 | 45% | 442 | 446 | 366 | 373? | 363 | 370? | 359 
8. 498? | 4972 | 4962 | 402? | — — 
LA.: B 409 | 408 | 406 | 404 333, | — | 330 | — | 329 
— | 401 — | 396 | 322? | 324 | 310? | 321 | 318? 
373 370 | 322 | 318 | 319 | 315 | 310 
3 391 we 327. | 325 | 322 322 
1936 | » 389 | 3945 | 387 | | 323'5 | 3215 | 321 | 318 | 310 
490°5 | 414°5?| 394°5 | 409? | 330°5 | 337 | 330 | 334°53| 326°5 
1937 414 | 421 | 413° | 418 | 337 | 335° «| 334 «| 332 333 
” ” 4or 397 324 326 320 322 317 
” 432, | 430. | 428 | 427, | 350 «| 348 «| 343 | 346 
432 | 442? | 428 | 434? | 35 | 356 | 347 351 | 346 
406 — | 402 — | 334 335 | 332 | 333 | 33% 
» | LA.: C (B.W.C.) 1 — | 402 | 341 335 | 337 | 329 | 335 
» ” 455 | 457. | 45° | 453 | 373 | 374 «| 368 | 370 «| 367 
373 379? | 370 | 374% | 204 208 | 289 | 203 | 288 
457 | 456 | 454 | 453 | 357 | 358 | 352 | 353 | 350 
445. | 449 | 430 | 440 | 379 | 386 | 376 | 382 | 374 
” 402 | 399 396 | 394 — | 324 
368 =| 372 | 365 | 369 | 316 | 304 | 312 | 301 =| 311 
423. | 428? | 419 | 425? | 330? | 343 | 337? | 339 «| 333 
» | LA: Cor RB. 442 | 444 | 439 | 438 | 361 | 360 | 357 357 | 355 
R.-B. 444 | 440 1441) 439 «| 366 363 361 «=| 358 | 361 
QRIV | ,, 417 | 417 | 414 | 412 | 329 | 330 | 326 | 325 | 324 
T17 ” LA.: B — | 363 | 365 | 359 | 360 | 353 
P 20 » | LA: 438 | 440 1436 | 434 | 347 | 349 | 345 | 346 | 34200 
- 2 wet : | 
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Measurements of long bones o 
Ulnae Clavicles 
100 
Maximum Maximum | Maximum 
L. R. L. R. | L. I 
248? | — [76 
228? | 252°5 — 140 | — 7 
241 — | — 
253 271 274 — | 160 7’ 
258 | 277 | 273? | 137 | 154| 7 
258 — | 273 | 156 | 160 : 
247 266 — | 150 
224 — | 242 —|— 7: 
241-5 | 256? | 257°5 | — | 150 7: 
251 270 271 — | 137 7 
259 | 284 285 | 148 | 152 7 
— | 280 152] — 7: 
265 290 286 — | 1457) 8 
241 260 261 149 | 157 7 
274 138?] — 7 
218 239 241 132 | — 7 
266 |[289?7]} — — | 156 7 
263 285 285 155 | 158 7 
260 — | 293 
243 267 268 
225 245 244 eo 7 
240 261 262 144 | 150 7 
2608 | 284 | 289 | 148] 157| 7 
272 289 293 —|]— 7 
249 266°5?] 269 — | 128 7 
238) | 255°5 | 256°5 | 134 | 143 7 
247 265 142 | — 7 
—= 135 273 159 | 166 7 
226 —_ 245 — | 141 
— | 259 259 130 | — 
215 238 235 130 | 133 7 
214 — | 231 132 | — 7 
211-5 | — | 232°5 | 131 | 137 7 
219 — | 238 | 141] 139) 7 
227 245 249 =. 7 
228 256 249 — | 151 
232 254 253? | 135 | 138%] 7 
212 — — | 141 | 142 
231 258 251? | 136 | 138 y 
I9I 218 214 131 | 132 4 
240 — | 2588 | — — 
186 — | 203 129 | — y 
196 223 221 121 | 122 ; 
221 240 235 
227 246 _ —|— ‘ 
— | | 143 
188 216 212 
— 146 | I51 ‘ 


i 
Max. ex spine Oblique Maximum Maximum Oblique 
L. R. L. L. L. L. 
| — | — | — | — | — | — | — 
| — | — | — | — | — 339? | 319 | — | 3135 
—}|— |] — | — | — | 318 | 283 | 276 | | 273 
362 363 361 361 361 362 | 324? | 32 317 321 
— | — | — | 368] 365 | 350 | 358 | 343 
309 308 | 366 354 — | 336 | 323 | 320 313 
347° 346 «| 343) 342 — | 336 | 315 — | 312 
— | 967 — | — | 339] 314 | 2908 | 310 | 295 
955°5 | 355°5 | 3545 | 355 | 3475 | — | 3205 | — | 3135) — 
376 | | 362 358 | 330 — | 328 
381 | 383 | 380 | 380 — |_— 
— — | 355) 390? | — | 34? | — 
363 | 363? | 362 | 359? — | 3542] 325 | 313 | 319 | 310 
380) | 379 | 375, | 373%) | — | | — 
| 406) | 409) | 405 | 409 | 358 
BS Rees Stes) 324 324 324 323 316 — | 290 292 288 287 
353° | — | 351? — — | — | 324? | 32x? | 317? | 316? 
374 375’ | | 373? | 371 | — | 332 | 328 | 320 | 323 
372 | 377) «| 371) «| | |: 371 | 335 326? | 330 | 323? 
— | 392 — [990 — 325 
348 | 352 | 347. | 351 346 352 | 320: «| 316 =| 309 
321? | 322 | 320? | 321 | 316 | 312 | 304 301 =| 300 ‘| 208 
352) | 353 | 351 | 350 342 | 339 | 312? | 305 | 310% | 304 
394 394: 393 392, | 386 | — | 353. | 349° 348) | 349 
383) | 380) 380 | 382 368) 368%) 356 338) 349 «333 
300°57) 362) | 360? | 361-5 | — | — | 331 | 3165 | 326-5 | 312 
— | 343°5%] — | 336? | — | 307 | 305 | 301 | 2095 
354 300) 352, | 358 | 338 | 351 | 318 | | 316 | 320 | 
363 | 370? | 359 | 368? | 362 | 366 | 323 | 314 315 307 
¥ — | 329 — — | 204 — |292 |4 
318? | 320 320 — | 296 — | 291 — J 
— | 349 — | 347 | 346 | — | 317 |303 | 313 | 300 
319 315 319 314 309 309 | 291 286 288 282 
323 322 322 320 317 | 288 281 286 278 
321 318 319 317 320°5?| 3162] 287-5 | 282-5 | 286 281 
| 326°5 | 331? | 327? | 327 | 300°5 | 208 | 207°5 | 205 
334 33! «| 329° «| 323 | 304 302 209 «| 4 
| 317, | 319 | | | — 
947) 343) 340) 342) «345 — | — | 308 
347 | 35% 346 | 350 | 343 311 | 301 | | 295 
332 333) «332 301 | 296 | 296 | 292 
337) 329) | «328 = 304 | 289 | 302 | 286 
305/370 307 | 367 — 323 | 315 | 313 | 308 
289 293 288 291 269 264 268 263 
3520 (353) | (| 351 321 | 316 312 | 309 
376 382 374 379 _ 318 312 312 304 
— 324 — | 322 | — 
301 | 31r | 300 — 264 | 255 | 261 | 253 
237° | 339) | 333, | 338) | 33t MMM) 310° | 300. | 315 | 208 
357 | 357 | 355 | 353 | 349 — | 315 | 307? | 308 304? 
| | — | — | 
34°) 342 | 344 | | MM) | 310 | 307 | 304 


TABLE VIII 
f long bones of skeletons from Maiden Castle 


Clavicles 
Radius max. Humerus max. Tibia obl. (H. max. + R.-max.) Angle of Claviclo max. 
70 Ffumerus max. | *°° — Femur obl. Femur obl. | obl. +7. max. ex spine) femur 100 Ffumerus max. 
Maximum 
R. | L. R. L. R. L. B. L. R. L. R. | L. R. 
140 | — 78°3 82-6? 71-2? 5° — 49°5 = 
— | 160 77°82 77'1 718? 72:2 80-0 79°5 70°8? 71-1? 6°-8 | 7°-6 48-8 
137 | 154 73°7 80-0 76-1 82-0 3 37°5 440 I 
156 | 160 79°9 769 73°4 84-2 | 83:2 71-9 5°5 | 5°4| 46-4 49°5 
— | 15°] 775 72°9 794 | 788 7182 = I 
— 70°9? 78-4? | 785 69-02 6°02] 8°-5 I 
— | 137 78:5 81-2 69°5 68-5 83°6 67°5 6°-6 | 3°-8 = 443 
148 | 152 78°5 — | — 448 166 
152 75°6 75°6 73°6 84-4 83:7 718 6°6 | 5°%4 | I 
— 81-5 84:5 — 72:0 8°-5 | 6°-6 44°62 166 
149 | 157 82-8 68-3 78-2 6°-8 | 7°8 49°4 54°0 I 
138?] — 78-2 72°42 70°2? 80-6? | 80:5? 71-3? 5°42] 42°52 16 
— | 156 79°8 75°3? 71-9 82-3? | 81-8? 72:2? 71-5? 8°-5 2] 6°6 I 
155 | 158] 782 80-7? 74°42 71°52 | 82-42 | 82-5 72°72 70°72 3°62] 6°6 46-3 485? | 
77°6 76°7 74°5 83-2 | 83:6 72-0 5°60 | 160 
74:0 747 78°62 | 77:2 72:7? 71-3 | 7°9 — 
144 | 150] 77-2? 78°7 75°42 72°8 | 72:22 70-6 4°0 | 46:22 49°2 
148 | 157] 73°7 772 76-0 86.0 | 85-4 720 6°6 45°0 
75°3 76°4 81-5 82:5 73°55 71:8 ° 
— | 128 74:0? 81-3? 81-1 71-6? 70-0 5°42] 40-4 
134] 143] 77°5 78-0 73°4? = 82:12] — 71-2? 6°32] — 43°6 469 
142 | — 76°1 76°5 70°7 713 | 9%4 44-7 
159 | 166 76°2 73°1 70°4 81-2 82-5? 70°7 49°2 52-9 
— | 141 72:8 81-0 — 6°-9 | 8°o0 48-0 
130 | — = 70°2 = — 
130 | 133 742 752 84°8 84-9 72°9 731 8°-3 | 447 46°5 
132 — 74°7 76-2 748 72°4 83-6 | 82:5 69°7 7°11 7°r | 
131 | 137 749 74°3 82-4 80-9 7°°7 69°6 5°83 | 6%5 45°6 48°5 
141 | 139 72°4 73°5 72°92 82-8 80-9? 715 69°5? | 9°32] 469 46°6 
743 75°4 73°6 806 | 787 70-0 70°5 4°0 | 6°8 
73°3 80-8 80-1 70°0 | 6°8 48-6 
73°6 771 72°7 69°4? 80-8 80-6? 67°92? |10°%9?| 43-4 45°82 
74:0 719 81-6 70-2 — 47°4 
728 718 69°5, 81-6 81-0 68-2 66-3 85 | 7°6 43°8 
72°5 723 72°7 70°62 77°8 77°8? 70°4 68-2? 7°3 | 9°32] 48-7 50°0 
76-0 75°9 69°8 77t | 775 70-1 69-0 | 6°-6 
72°3 73°5 70°52? 79°7? 65-0? 6°02] 9°4 50-4 
750 769 72°3 69°1 | 813 68-2 67°3 723] 458 478 
73°7 75°4 70°62 79°5 79°5? 714? 68-2? 7.9.1 — 
73°3 718 70°12 80-9 6 68-6 67-2? | q 
718 714 784 | 7or2 | 46-6 | 


} 
| 
@ 


No. Remarks 
L. R. L. 
Qi Most of the long bones are mutilated 
9° 3 — 
7°6 48°8 T4 
— — — T5 Severe injury to R. elbow joint, and healed fractures of L. radius (near 
wrist) and L. fibula (near ankle), the latter bone being fused to the tibia 
5°°3 37°5 440 T Io R. clavicle markedly shorter and thicker than L. 
49°5 Ti 
om T 25 
8°5 — Ni 
5°4 | 44°7 P7 
6°-6 44°6 Iron arrow-head embedded in twelfth thoracic vertebra 
7°8 49°4 P 9 
6°-6 — 47°6 P 23 R. ulna fractured 
6°-6 46°3 48°5? P24 
5°2 25 Osseous depesits on muscular ridges of L. femur and L. fibula 
5°°6 27 — 
4°°0 46°22 49°2 P29 
| 45°0 P 30 
— P 34 
7°r 40°4 Or 
9°°4| QR V 
49°2 52°9 T 16 
41°3 T 18 
| 46°5 T 24 
45°8 Qi = 
5°°8 | 6%5 04 
| 46°6 N2 
4°0 | 6°8 T6 
81 | — — Ti2 L. humerus markedly deformed, probably by inflammatory condition of 
soft tissues 
10°97? 45°82? 22 
48-0 T 26 
7°0 Ps 
43°8 P14 
9°:3? 50°0 P 16 
6°-6 P19 
P31 
7°°3 478 P 36 Healed fracture of L. fibula 
6°-8? P 37 
= QR II 
9°'5 48°7 P20 Epiphyses of ulnae not attached 
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Goodman and Morant: Human Remains from Maiden Castle 
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A typical male skull (P 30) from the Belgic War Cemetery. | 
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Biometrika, Vol. XXXI, Parts III and IV Plate II am 
Goodman and Morant: Human Remains from Maiden Castle _ 
A, P 20, female. Injury to right malar bone. B. P7A, male. Trace of suture on right parietal bone. ) 
A, ' 
) 
C. P 36, female. Exceptional type of facial D. P 36, female. Profile view of the same skull (C). ° D. 


skeleton and high cephalic index (87-0). 


Exceptional skulls from the Belgic War Cemetery. 


3 
“4 : 
we 
3 
3 
| 
4 
34 
tha 
4 
‘aon 


| , 
) 
Biometrika, Vol. XXXI, Parts III and IV Plate III 
Goodman and Morant: Human Remains from Maiden Castle 
| 
B. O 3, male. Broken basi-occipital showing the 
extension into it of sphenoidal air sinuses. 
) 
A. T 21, female. 
6 
D. T 28, female. 


Absence of canines and first premolars E. P 14, female. Left third molar impacted and 
rotated, 


mandibular torus. 


Skulls from Maiden Castle with dental and other anomalies. 
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HOMOGENEITY OF RESULTS IN TESTING SAMPLES 
FROM POISSON SERIES 


WITH AN APPLICATION TO TESTING CLOVER SEED FOR DODDER 


By J. PRZYBOROWSKI anp H. WILENSKI 
Plant Breeding and Agricultural Experimentation Department, 
University of Krakow, Poland 


1, INTRODUCTORY 
In our previous paper, ‘‘Statistical principles of routine work in testing clover 
seed for dodder”’ (Przyborowski & Wilenski, 1935), we justified the assumption 
that the number of dodder seeds in samples of clover does follow the Poisson 
Law: 


pie) =e™™ (@=0, 1,2...) (1) 


Thus we constructed rules which should be followed in sampling problems 
where that Law holds good. We analysed particularly the application of the 
rules to the routine work in testing clover seed for dodder. 

The purpose of this paper is to present some theoretical considerations 
relating to the problem of homogeneity of results in testing samples drawn 
from Poisson series which we have found arising in the course of our work on 
testing clover seed. 

Let m, and m, denote the parameters of the Poisson distributions of the 
variables x, and x,; the problem to be considered consists in testing the hypo- 
thesis H,(m,=m,=m), that the values x, and x, were obtained in sampling 
from Poisson series, the parameters of which have a common but unspecified 
value, say m. 

The method of testing statistical hypotheses as developed by Neyman & 
Pearson (1933) consists in selecting a rule of rejecting the hypothesis in question, 
whenever the sample point H (the co-ordinates of which in the n-dimensional 
sample space W are the data of observation 2, 2, ...,2,) lies within the 
so-called critical region, say w, of the sample space W. The probability, 
P{Eew | Hy} =a, of rejecting the hypothesis tested, when it is true, is called 
the size of the corresponding critical region w on which the test is based. The 
errors thus committed when rejecting the true hypothesis Ho, are called the 
errors of the first kind. The probability P{Hew | H’} of rejecting the hypothesis 
H, when an'alternative H’ is true has been termed the power of the test with 
regard to H’. P{Hew | H’} considered as a function of H’, where H’ is any 
hypothesis belonging to the class Q of alternatives to Hg, is called the power 
function of the test. The error committed when we fail to reject Hy although 
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an alternative, H’, is true, has been termed an error of the second kind. The 
risk of committing this error is 1 — P{Hew | H’}. 

The general guiding principle to be followed in selecting a test of Hy is to 
choose out of critical regions of size a, a region which makes the power of the 
test with regard to the admissible alternative hypotheses of 2 as large as 
possible. In certain cases there will be a single region, say w), which maximizes 
the function P{Hew | H’} for every admissible alternative. The test based on 
this region, if it exists, is called the uniformly most powerful test of H, with 
regard to the set 2. In very many problems, however, and the present is one 
of them, no such region exists. We shall now consider in our problem what form 
of test is possible, and what are some of its properties. In doing this, it should 
be pointed out that because the random variables zx, and x, can only assume 
integral values, the theory of unbiased tests as developed by Neyman & Pearson 
is not strictly applicable, although the test we shall reach is analogous to the 
unbiased test of type B (Neyman, 1935). 


2. DERIVATION OF TEST 


Suppose, in the first place, that m, does not necessarily equal m, and that 
we write 


(3) 
p=m/(m+m) (4) 


Then the joint elementary probability law of the observations 2,, 2, takes 
the form mi mgs e-m,+m,) 


P(X, Lz | My, My) = 


The sample space is a two-dimensional one as shown in Fig. 1, and since the 
values of x, and x, are necessarily integral, the possible sample points FE are 
distributed discretely as indicated. Along each diagonal line, n = x, +2, = con- 
stant. The form of (6) shows that the probability law is the product of two 
terms: 


(1) The chance p(n | = (7) 


that H# falls on the line (2), which depends only on p; 


(2) The chance that for a given n, the observed partition into x, and 
= Occurs, or 


Pe A) = (8) 


This is a term of the binomial expansion {(1 — p) + p}", depending on p, but not on pv. 
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The hypothesis to be tested, Hy, is that m, =m, or that p= 4; we shall 
suppose that the admissible alternatives to H, are both that m,>m, and 
m, < mz, or that p<} and p>}. Since H, does not specify the value of w (which 
may have any value between 0 and 00), it is a composite hypothesis in the 


A 
164 


Scale of x, 


Ly 


9 10 11 12 13 144 «15 «6 «18 


Fig. 1. Critical region for case «=0-05. It consists of the two areas containing the larger dots. 


sense of Neyman & Pearson. To test it, we should like to be able to use a critical 
region, w, such that 


whatever be 4. No such region can be found, owing to the discontinuous 
character of the distribution of the variables, but it is possible to go a long way 
in determining a satisfactory region on the following lines. 

It will be noted that if p= 4, then the distribution of x, on each line, 
n = constant, is that of the symmetrical binomial ($+ 4)". If p is either >} 
or <}, we shall have a skew binomial with probability density concentrating 
towards the lower or upper end of the line; in order to make the test as efficient 
as possible in detecting departures from p = } in either direction, the critical or 
rejection region should therefore include the two tails of each of the separate 
binomials ($+ 4)". It may be defined as follows. 
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Build up the critical region w of “pieces”, w(n, «), of the lines 


n=2,+2%,=constant (n=0,1,2,...), (10) 
where w(n, «) includes all sample points on (10) for which 


k(n, «) being a positive integer determined so that: 
(a) the sum of the terms of the binomial ($+ 4)" corresponding to 
0,1,..., k(n, a) is <}a; 
(6) the sum of the terms corresponding to 
0,1,..., A(n, a), k(n, a)+1is > $a. 
The pieces w(n, «) and part of the complete region w for the case a = 0-05 are 


indicated in Fig. 1. In view of the form of the probability law (6) it will be 
seen that the region w is such that 


whenever the sample point is included in w, i.e. falls at any of the points in- 
dicated in Fig. 1 by the larger spots, we know that the risk of the first kind of error 
is at most equal to «. Further, while it cannot be claimed that the test is com- 
pletely unbiased or is the uniformly most powerful unbiased test,* it seems likely 
to be as efficient as any other alternative test in detecting departures in p from }. 

In Table I we have given the boundary values k(n, «) of the critical regions 
associated with four values of «, viz. 0-20, 0-10, 0-05, 0-01, and for n = 2,+2, 
varying from 0 to 80. Having decided on the appropriate value for a, the rule 
of the test is: reject the hypothesis that m, = m, if 


X,<k(n,a) or x, >n—Kk(n, a). 
It will be seen that for small values of n no test associated with the « required 
may be possible. 


The power function of the test, which depends on both jw and p, may be 
calculated from the expression 


where > is the sum of the terms of the binomial expansion 
wn, a) 
| +0)", 
satisfying the conditions (11). 
Values of the power for the two cases « = 0-10 and a = 0-05, and for a number 
of values of » and p are given in Table Ila and b. The computation involved 
(a) first finding for a given n the sums of the binomial terms in the second 
summation of (13) with the help of the Tables of the Incomplete Beta Function 


* These terms have been applied to problems of continuous variables. 
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P{Kew | p = 3, <a, 
Cae whatever be yw. Thus if we reject the hypothesis H, which assumes m, = mg, 
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(1934), and (+) then multiplying this by the appropriate term of the Poisson 
series, and (c) finally summing for n = 0, 1, 2, .... 

It will be noted that if the admissible alternatives to m, = m, are only that 
m,>m,, then the appropriate critical region will be defined by x, >n—k(n, a), 
and the risk of the first kind of error will now be <}a. Similarly, if the alter- 
natives are only m, < mg, the region will be defined by x, < k(n, «). These regions 
correspond to the lower and upper marked areas, respectively, of Fig. 1. 

We have also given in Tables II in certain cases the actual value of 
P{Eew | p = 3,4}, that is to say, the chance of rejecting the hypothesis that 
m, = m, when it is true. These chances are, as expected, considerably less than 
the corresponding values of a, but approach a as mw increases and the dis- 
continuity in the distributions of x, and x, becomes of less importance. In 
practice we shall not, of course, know exactly what y is, but as we shall probably 
have a rough idea of the size of the Poisson m in the particular problems 
investigated, the figures ,in these columns of the Tables will probably be helpful 
in determining the value to select for a, the upper limit of the significance level. 
Table ITI contains true significance levels for the other two limiting values, 
a = 0-20 and «=0-01, for which the rejection levels k(n, «) have been given 
in Table I. 

From the data of Tables II it was possible to construct the charts given in 
Figs. 2 and 3 (pp. 322, 323 below), showing in terms of the Poisson parameters 
m, and m, (rather than » and p) some of the contours of equal power. 


3. DISCUSSION OF TABLES AND CHARTS WITH SOME 
ILLUSTRATIVE EXAMPLES 

A point which is clearly brought out by the present investigation is that it 
is impossible to detect differences between the means of two Poisson series 
m, and m,, when both these quantities are small, even if one is several times 
larger than the other. Thus even were m,=0 and m,=5, the chance of 
detecting the difference from the samples would be only 0-384 using a = 0-05, 
and 0-560 using a = 0-10.* As an illustration of the kind of differences that may 
be detected, Fig. 2 may be used as follows. It might with reason be regarded 
as undesirable to plan an experiment in which the chance was less than 0-5 of 
detecting from two random samples that differences between m, and m, of 
practical importance existed. If we pass along the 50% power contour in the 
diagram, we pass through points (m,, m,) of which the following are typical: 

(0-0, 4-7), (2-0, 81), (6-0, 14:3), (10-0, 19-8), (18-0, 30-5). 

if it is important to be able to detect differences of this magnitude, then larger 
samples must be taken in order to increase the expectations. If the samples 

* The true values of the significance level in these cases would be 0-09 and 0-024 respectively ; 


if the test were used with « = 0-20 (and a significance level in this case of 0-065) the chance of 
detection would be somewhat greater. 
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can, for instance, be increased c-fold, the expectations will be increased to 
cm,, Cm,, a point lying on the same straight line passing through the origin as 
M,,™M,, but placed ¢ times further out. Since such radial lines will cut the 
contours of equal power at rather acute angles, considerable multiplication of 
sample sizes may often be needed to ensure detection of differences. 

Regarding a chance of detection of 19 to 1 as satisfactory, we note that the 
95% contour in Fig. 2 passes through points (m,, m,) of which the following 
are typical: 

(0-0, 9-1), (2-0, 15-5), (6-0, 24-0), (10-0, 31-5). 


Differences of this kind, if they exist, can therefore be almost certainly detected 
using in the test the critical values associated with a = 0-10. Rather smaller 
differences could be detected with the same high probability if « were taken 
as 0-20. 

In practice, of course, expected values m, and m, are not known in advance, 
but we believe that reasonings of this kind based on the tables or charts may 
be useful as a preliminary step in planning the extent of sampling required 
either in experimental work or routine analysis. 

Example. In the testing of clover seed for dodder it is customary to withdraw 
a 100 gr. sample from a sack with a long trier or probe. Clearly the number 
of dodder seeds found will vary from sample to sample, and from one sack to 
another in the same consignment. This may be due to the following causes: 

(1) The material from which the samples were drawn was not thoroughly 
mixed. 

(2) The two samples were drawn from different seed stocks. 

(3) There are laboratory errors in analysis. 

(4) The existence of random sampling fluctuations. 

In applying statistical analysis in the comparison of two counts, the 
hypothesis to be tested assumes that the discrepancy to be tested is merely 
due to chance, and that the frequency of dodder seeds, x, will vary from one 
sample to another in accordance with the Poisson law. If a significant difference 
is found, it may of course be due to one of the first three sources of error. 

Suppose that a 100 gr. sample is drawn from a sealed sack of clover and 
sent to a seed testing station, and that no dodder seeds are found in it, i.e. 
x,=0. The buyer, before opening the sack, takes another 100 gr. sample, and 
on having it analysed learns that three seeds have been found, i.e. x, = 3. 
Would he have any justification for criticizing the first testing on the grounds 
that the difference between 0 and 3 could not be due to chance? On examining 
Table I, it is seen that for n =x,+, = 3, even using the least stringent of the 
four tests, the difference cannot be regarded as significant.* 

Suppose that on another occasion it was found that.x,=2 and x, =6. 


* It will be noted from Table III that with » equal to 5 or less, the true significance level for 
the first test with « = 0-20 is at about 0-06. 
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Entering Table I with n =8, we see that for none of the levels of test does 
x, =2 fall in the critical region. If the purchaser still considered that to find 
three times as many dodder seeds in one sample as in the other could not be 
due to chance, we might illustrate the danger of drawing conclusions in this 
way by a use of one of the charts. If one of the expected numbers were three 
times the other, so that m, = 3m,, and therefore p = 0-25, we may ask how large 
/# = m,+m, must be before there is as much as an even chance of detecting the 
difference. Taking Fig. 3, it is found that the line m, = 3m, cuts the 50% power 
contour at about the point m, = 13-8, m, =4-6. Thus samples of clover at least 
twice as large as those taken would be needed to provide only a 50: 50 chance 
of detecting a real difference in dodder content of the order suspected by the 
purchaser. In this case the true significance level of the test used is seen from 
the number on the diagonal line of the chart to be about 0-03. 

To be almost certain of detecting a difference with m, = 3m,, for example 
to make the odds 19 to 1, we notice that a continuation of the same line cuts 
the 95% power contour where m, = 38-4, m, = 12-8. For this samples of about 
600 gr. would be needed. 


4. SUMMARY 


The problem considered is that in which x, and x, are two independent 
random variables distributed in accordance with the Poisson law of equation (1), 
and it is desired to test the hypothesis that the expectations m, and mz, are the 
same. We have shown how a test may be derived which is independent of the 
value of the unknown common hypothetical expectation but which, owing to 
the discontinuous nature of the probability distributions, will only provide an 
upper limit to the significance level, i.e. to the chance of rejecting the hypothesis 
tested when it is true. The manner of approach of the significance level to its 
upper limit has, however, been investigated numerically. 

A table (Table I) has been provided, containing critical values k(n, «) 
required in carrying out the test. The power function of the test has also been 
determined, and tables (Table IT a, 6) and charts (Figs. 2, 3) given which make 
it possible to determine the chance of detecting differences in the expectations 
m, and m, of specified magnitudes. Finally, a discussion of some uses of the 
test has been added. 


In conclusion we should like to express our thanks to Prof. E. S. Pearson 
for his helpful suggestions and criticism made during the course of our in- 
vestigation.* 


* [Certain modifications and additions to the paper have been made since it was received for 
publication at the beginning of July 1939. As circumstances have unfortunately made com- 
munication with the authors impossible, I must accept responsibility for these alterations, which 
have been mainly concerned with the extension of Tables I and II to higher values of n and p. 
E. S. P.] 
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TABLE I 
Boundary values, k(n, a), for each n 


Upper limit of significance level, « Upper limit of significance level, « 
n n 
=X, +2, =2, 
0-20 0-10 0-05 0-01 0-20 0-10 0-05 0-01 
1 41 15 14 13 il 
2 42 16 15 14 12 
3 : 43 16 15 14 12 
4 0 44 17 16 15 13 
5 0 0 45 17 16 15 13 
6 0 0 0 46 18 16 15 13 
7 1 0 0 47 18 17 16 14 
8 1 1 0 0 48 19 17 16 14 
9 2 1 1 i) 49 19 18 17 15 
10 2 1 1 0 50 19 18 17 15 
11 2. 2 1 0 51 20 19 18 15 
12 3 2 2 1 52 20 19 18 16 
13 3 3 2 1 53 21 20 18 16 
14 4 3 2 1 21 20 19 17 
15 4 3 3 2° 55 22 20 19 17 
16 4 4 3 2 56 22 21 20 17 
17 5 4 4 2 57 23 21 20 18 
18 5 5 4 3 58 23 22 21 18 
19 6 5 + 3 59 24 22 21 19 
20 6 5 5 3 60 24 23 21 19 
21 7 6 5 4 61 24 23 22 20 
22 7 6 5 4 62 25 24 22 20 
23 | 7 6 4 63 25 24 23 20 
24 8 7 6 5 64 26 24 23 21 
25 8 7 7 5 65 26 25 24 21 
26 9 8 7 6 66 27 25 24 22 
27 9 8 7 6 67 27 26 25 22 
28 10 9 8 6 68 28 26 25 22 
29 10 9 8 7 69 28 27 25 23 
30 10 10 9 a 70 29 27 26 23 
31 ll 10 9 7 71 29 28 26 24 
32 11 10 9 8 72 30 28 27 24 
33 12 11 10 8 73 30 28 27 25 
34 12 ll 10 9 74 30 29 28 25 
35 13 12 ll 9 75 31 29 28 25 
36 13 12 11 9 76 31 30 28 26 
37 14 13 12 10 77 32 30 29 26 
38 14 13 12 10 78 32 31 29 27 
39 15 13 12 ll 79 33 31 30 27 
40 15 14 13 ll 80 33 32 30 28 


Rule of test: reject hypothesis that m, = m, if x,<k(n, a) or x, >n—K(n, a). 
See note on p. 323 regarding extension of limits for n >80. 
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Power of test; upper limit of significance level, a = 0-10 


TABLE Ila 


p=0-0 
(hypothesis p=0-4 p=03 p=0-2 p=0-1 (m,=0, 
=m, + true) 
2 —- 0-004 0-008 0-016 0-030 0-053 
3 — 0-013 0-028 0-057 0-106 0-185 
4 0-026 0-056 0-116 0:217 0-371 
5 0-024 0-039 0-087 0-182 0-338 0-560 
6 _- 0-050 0-117 0-248 0-453 0-715 
7 — 0-061 0-147 0-311 0-553 0-827 
8 oe 0-071 0-176 0-371 0-638 0-900 
9 —_ 0-081 0-205 0-424 0-709 0-945 
10 0-047 0-093 0-235 0-482 0-768 0-971 
ll = 0-100 0-264 0-533 0-817 0-985 
12 - 0-110 0-292 0-580 0-856 0-992 
13 -— 0-119 0-320 0-624 0-888 0-996 
14 — 0-128 0-347 0-664 0-913 0-999 
15 0-059 0-136 0-373 0-700 0-933 0-999 
20 0-065 0-172 0-482 0-831 0-983 1-000 
25 0-068 0-205 0-579 0-906 0-996 oo 
30 0-071 0-239 0-661 0-949 0-999 — 
35 0-073 0-272 0-732 0-973 1-000 — 
40 0-074 0-301 0-784 0-986 — — 
45 0-074 0-330 0-829 0-993 — — 
50 0-075 | 0-359 0-865 0-996 — — 
TABLE II 6 
Power of test ; upper limit of significance level, a = 0-05 
p= 0-5 | p= 0-0 
(hypothesis p=0-4 p=03 p=02 p=0-1 (m,=90, 
=m, + mM, true) | 
| 
2 — 0-001 0-002 | 0-004 0-009 0-017 
3 — | 0-004 0-009 0-021 0-044 0-084 
4 a 0-009 0-023 0-054 0-113 0-215 
5 0-009 0-016 0-042 0-099 0-208 0-384 
6 — 0-023 | 0-063 0-152 0:313 0-554 
7 0-031 0-085 0-206 0-416 0-699 
8 = 0-037 0-107 0-260 0-510 0-809 
9 — | 0043 | 6129 | 0312 0-593 0-884 
10 0-022 | 0049 | 0-361 0-664 0-933 
0054 | 0-408 0-724 0-962 
12 060 0-192 0-453 | 0-774 0-980 
13 — 0-065 | 0-213 0-496 0-817 0-989 
| 14 0-070 | 0-235 0-537 0-852 0-994 
| 15 0-027 | 0076 | 0256 0-576 0-881 0-997 
= | 0-029 | 0-099 | 0353 0-731 0-962 1-000 
25 0-030 | 0-120 0-440 0-833 0-989 — 
30 0-031 0143 0524 |- 0-900 0-997 
35 0-033 | 0-170 0-604 =| 80943 0-999 
40 0-034 o198 | 0675 | 0969 | 1-000 ion 
45 | 6224 | 0-735 | 0-983 
00387 || 0-250 0-991 | 
| 
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Upper limit of significance level Upper limit of significance level 
0- 0-01 
True level Bb True level 
0-065 5 0-0007 
0-109 10 0-0031 
0-123 15 0-0042 
0-132 20 0-0048 
0-137 25 0-0054 
0-141 30 0-0058 
0-148 35 0-0062 
0-152 40 0-0065 
0-154 45 0-0067 
0-155 50 0-0068 


N.B. If hypothesis tested is true, «=2m,=2m,. 
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Note regarding Table I. When x, and zx, are large, if the hypothesis tested is true then 
(x, —2g)/4/(x,+2_) may be regarded as a unit normal deviate or, from another point of view, x, 
should be normally distributed about 4n with standard deviation $4/n. On this basis, in the 
case of n=80, the values of k(n, «) for «=0-20, 0-10, 0-05 and 0-01 are found to be 34-3, 32-6, 
31-2 and 28-5, respectively. Applying a correction for discontinuity we have 33, 32, 30 and 28 for 
the integral values of k(n, «) associated with upper limits of significance level equal to the four 
values of «. These are the values for k(n, «) given in the last row of Table I. It is clear that 
beyond x =80 the normal approximation is valid. 
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ON THE METHOD OF PATRED COMPARISONS 
By M. G. KENDALL anv B. BABINGTON SMITH 


INTRODUCTION 


1. Suppose we have a number of objects A, B, C, etc. which are to be 
considered according to the different degrees in which they exhibit some 
common quality. If the quality is measurable in some objective way the 
objects will yield a number of variate values, in which case the problem is 
amenable to treatment by well-understood methods. It may, however, happen 
either for theoretical or for practical reasons that the quality is not measurable. 
We then have to rely for a discussion of the variation of the quality on judgments 
of a more or less subjective kind carried out after a comparison of the objects 
among themselves. 

One of the methods of comparison which has been widely used in this 
connexion is that of ranking. An observer examines the objects and arranges 
them in the order in which he judges them to possess the quality under con- 
sideration. This arrangement is‘called a ranking, and when two or more observers 
provide rankings of the same set of objects there arise the familiar questions of 
the type: is there any significant resemblance between the judgments of 
observers? or, do the data furnish any evidence that the objects have a “real” 
objective ranking? 

2. The ranking method suffers from a serious drawback when the quality 
considered is not known with certainty to be representable by a linear variable. 
We may, for instance, ask an observer to rank a number of individuals in order 
of intelligence, and he may comply with the request in the full belief that he 
is doing something within his powers; but if intelligence is not measurable on 
a linear scale this ranking may fail to give a real picture either of the observer’s 
preference or of the variation of intelligence among the individuals. It is not 
impossible that the observer should judge A more intelligent than B, B than C, 
and C than A, if the individuals are presented for his consideration one pair at 
a time. The likelihood of this happening is obviously increased when we are 
dealing with tastes in music, eatables or film stars; and in practice the event 
is not uncommon. Such “inconsistent” preferences can never appear in ranking, 
for if A is preferred to B and B to C, then A must automatically be shown as 
preferred to C. The use of ranking thus destroys what may be valuable infor- 
mation about preferences. 


3. In this paper we consider a more general method of investigating pre- 


ferences. With n objects, we shall suppose that each of the (3) possible pairs 


is presented to an observer and his preference of one member of the pair noted. 
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We assume that a choice between two objects can always be made.* With 


m observers the data then comprise m(%) preferences. The questions to be 
discussed include: 


(a) Is there any evidence that a particular observer is capable of forming 
a reliable judgment of the quality under investigation; and if not, is the fault 
his, or is it due to the fact that he has been asked to perform an impossible task ? 

(b) Is there any significant concordance of preferences between observers? 

(c) Can the quality under discussion be represented by a linear variable? 

4, The method of offering for judgment objects two at a time is known as 
the method of paired comparisons. Hitherto it has been used mainly in human 
psychology, but it has some interesting applications in animal experimentation. 
For instance, in feeding experiments it is impossible to get an animal to rank 
a number of foods in order of preference but it is not difficult to offer pairs of 
foods and to note which is taken first. Experiments of this kind have, of course, 
to be conducted with great care to ensure that conditions operating when the 


different pairs are offered are as constant as possible; but the difficulties are . 


far from being insuperable and the method of paired comparisons offers a useful 
technique in cases where the more usual procedures cannot be applied. From 
the point of view of theoretical statistics perhaps the most interesting part of 
the present work is that it offers some lines of approach to the difficult question 
whether a given quality can be legitimately regarded as based on a linear 
variable, i.e. whether ranking or scoring methods are justifiable or not. 


CONSISTENCE IN PREFERENCES 


5. If the object A is preferred to B we write A>B or B<A. The (3) 


preferences of a single observer may be represented in tabular form as shown 
in Table I. 

In this table, which is shown for the six objects A to F, an entry of unity 
in column Y and row X means X-> Y, and is thus accompanied by a comple- 
mentary zero in row Y and column X. The diagonals are blocked out. For 
example, in Table 1, A>B, A+>C, D > A, ete. 

The arrangement of the objects A to F in the row and column headings is 
quite arbitrary. There are (n!)? ways of representing the same configuration of 
preferences in such a table according to the permutations of objects in row and 


* That is, we exclude cases in which an observer cannot make up his mind which object he 
prefers, just as in the ranking case one excludes the possibility of split ranks. In practice it some- 
times happens that an observer is genuinely unable to reach a decision. To allow for this fact in 
the theoretical discussions would introduce complications of a most intractable kind. When the 


effect becomes important in practice it can be allowed for by selecting the set of preferences which 
are most unfavourable to the hypothesis under test. 
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TABLE I 

A B E F 
A l 0 1 
B 0 — 0 1 1 0 
0 l 1 1 
D 1 0 0 — 0 0 
E 0 1 1 
F 0 1 0 1 0 — 


column; but in practice it is generally desirable to have the order in row and 
column the same, and even among the n! possible arrangements so given there 
are often practical considerations which determine one order as more convenient 
than others. 

6. Paired comparisons may also’be represented geometrically by a method 
which can be illustrated for the case of the six objects as follows: 


A B 


>. 
> 


E D 


Fig. 1. Geometrical representations of the scheme of preferences of Table I. 


We represent the six objects A to F by the six vertices of a regular hexagon 
and join the vertices in all possible ways by straight lines. If A+B we draw 
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an arrow on the line AB pointing from A to B. The arrows shown on Fig. 1 
correspond to the preferences shown in Table I.* 

7. If an observer makes preferences of type A>B—+C-—A we say that the 
triad ABC is inconsistent. In the geometrical representation an inconsistent 
triad is shown by a triangle in which all the arrows go round in the same 
direction. We may thus speak of a “circular” triad of preferences. In Fig. 1 
the triads ACD, BEF and three others are circular. 

It is also possible to have inconsistent triads of greater extent; but any 
such circuit must contain at least two circular triads. Suppose, for instance, 
that ABCD is circular, e.g. that A> B+C—+D-A. Then either A>C or C>A. 
In the first case ACD is circular, in the second ABC. Similarly either ABD or 
BCD is circular. Thus the circular tetrad must contain just two circular triads. 
On the other hand it is possible for a tetrad to contain circular triads without 
being itself circular. 

Similarly, if ABCDE is circular either ABC or ACDE is circular and either 
BCD or BDEA is circular. If the two tetrads are circular there must be at least 
three circular triads (not necessarily four, because ADE may be common to 
both). It is easy to see by an actual example based on this configuration that 
there need not be more than three circular triads; and it is clear that there must 
be at least three. For if the tetrads are not circular then ABC and BCD must 
be so and then either CDE is circular or ABCE is so, adding at least one more. 

Generally, it appears that a circular n-ad must contain at least (n— 2) 
circular triads; but it may contain more, and the fact that an n-ad contains 
(n—2) circular triads does not mean that it is itself circular. In discussing 
inconsistences, therefore, it seems best to confine attention to circular triads, 
which, so to speak, constitute the inconsistent elements of the configuration, 
and to ignore the more ambiguous criteria associated with circular polyads of 
greater extent. 

8. We now prove the following theorems: 

(1) The maximum possible number of circular triads is (n’—n)/24 if n is 
odd and (n*— 4n)/24 if n is even; and the minimum number is zero. 

(2) These limits can always be attained by some configuration of pre- 
ferences. 

(3) For any integral number between the maximum and the minimum 
there exists at least one preference-configuration with that number of circular 
triads; and in general there wi!l be more than one. 

Consider a polygon of the type shown in Fig. 1 with n vertices. There will 


* These preferences were obtained in an experiment on a dog, which was offered the following 
foods in pairs: meat, biscuit, chocolate, apple, pear and cheese. The members of a pair were cut 
to the same size and placed equidistantly from the dog, which was then released and allowed to 
choose. All the pieces of food were eaten avidly, it being that sort of dog, but there were considerable 


inconsistences in choice. We do not offer these data as more than an illustration of the 
method. 
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be (n—1) lines emanating from each vertex. Let a, a, ..., a, be the number 
of lines at the respective vertices on which the arrows leave the vertex. 


Then 8 = (5) 
r=1 2 
and the mean value of a, is (n —1)/2. 
—}\2 
Define T = 8 
r=] 
n(n —1)? 


We now show that if the direction of a preference is altered and the effect 
is to increase the number of circular triads by d, T' is reduced by 2d; and 
conversely. Consider the preference AB. The only triads affected by altering 
this to B> A are those containing the line AB. Suppose there are « preferences 
of type AX (including A+B) and # preferences of type B->X. Then four 
possible types of triad arise: 


A->X<-B, say p in number 


A<X-—B, 
A-+>X-—>B, which must number a—p-1 
A<X<B, B-p. 


When the preference A->B is reversed the first two remain non-circular. 
The third becomes circular, the fourth ceases to be so. The reduction in the 
value of 7 is a? (a —1)2+ +1)? 

= X(a—f-1) 
= 2d, say. 
The increase in the number of circular triads is 
(a—p-1)-(f-p) =a-f-1 
=. 

More generally, if as the result of reversing any number of preferences 7' is 
decreased by 2d, then d must be an integer and the number of circular triads 
must be increased by d. This clearly follows from the previous results for the 


reversal of preferences can take place one at a time and the effect on 7 and the 
number of circular triads is cumulative. 


We now investigate the maximum and minimum values of 7’. It is clear 
from the definition that 7' is greatest when the @’s are the natural numbers 


1, 2,..., n; and this is a possible case because it corresponds to ordinary ranking. 
Hence max. (7') = (n?—n)/12. 
For the minimum value, consider the polygon A,, Ag, ..., A,. Set up the 


preferences A, A,—>...A,—>A,. Clearly at any vertex this results in one arrow 
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entering and one leaving the vertex, i.e. the contribution to @ is unity at each 
vertex. Next set up the preferences A, A,>A,;—.... This circuit may either 
visit each vertex once, or not. In the latter case we proceed to an unvisited 
vertex and set up the preferences A,>A,,,>A,,,—>-.. and so on. Again there 
will be a unit contribution to all the @’s. 

We then set up the preferences A, A, A,-—> etc. and so on; and in this 
way we shall ultimately complete the preference scheme. 

If n is odd all the preferences described will consist of circular tours of the 
polygon, and thus the value of « for each vertex will be (n—1)/2. If n is even 
the last preference A,—A,,,, will not be a tour but will consist of the single 
line joining one vertex with the symmetrically opposite vertex. Thus there will 
be n/2 vertices for which a = n/2 and n/2 vertices for which a = (n—2)/2. In 
this case T' = n/4. 

Now it is clear from the definition of 7' that it cannot be less than zero, or 
if m is even, be less than n/4. The configuration just given shows that these 
minima are, in fact, attainable. 

Thus 7 can vary from a maximum of (n*—n)/12 to a minimum of zero 
or n/4. Hence the maximum number of circular triads, being half the variation 
from maximum to minimum of 7' (the maximum of 7' corresponding to the 
ranking case in which there are no inconsistences) is (n*—4n)/24 if n is even 
and (n3—n)/24 if is odd. 

This establishes the first two results enunciated at the beginning of this 
section. To prove the third it is sufficient to give a systematic method of 
proceeding from the configuration. of minimum to that of maximum incon- 
sistence by steps decreasing 7' two at a time. Consider, for example, the case 
n = 8. For the minimum inconsistence the ’s will have the values 0 to 7, which 
we set out thus: 


A B C D E F G H 
0 1 2 3 + 5 6 7 
We proceed by reversing the preferences between vertices whose a-values 
differ by two. This clearly reduces T' by two. 
Reversing the preferences between C and E we get 
A B Cc D E F G H 
0 1 3 3 3 5 6 
and between D and F we get 


A B C D E F G H 
0 1 3 4 


w 


which we may rearrange as 


A B C E D’ F G H 
0 1 3 3 


for) 
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Now reversing the preferences between B and E and between D and G and 
rearranging we have 
A B E C F D G H 
0 2 2 3 4 5 5 Ss 
and now interchanging A and B, G and H, 


A B E C F D G H 
1 1 2 3 4 6 6 


At this stage we have preserved the a-numbers 2, 3, 4 and 5 in the middle 
but reduced the extremes A and H. We can now carry out the process again, 
arriving at the a-numbers. 


1 2 2 3 + 5 5 6 
and twice again, giving ‘ 
2 2 3 3 4 4 5 5 


whence a final interchange gives 
3 3 3 3 + 4 + 4 


and this is the position of maximum inconsistence. It is readily verified by 
following the interchanges on a polygon diagram that the reversals are, in 
fact, legitimate. 


COEFFICIENT OF CONSISTENCE IN PAIRED COMPARISONS 


9. If d is the number of circular triads in an observed configuration of 


preferences we define 24d 
odd | 
=l1- ™ even 


and call ¢ the coefficient of consistence. If and only if it is unity there are no 
inconsistences in the configuration, which may therefore be represented by a 
ranking. As ¢ decreases to zero the inconsistence, as measured by the number 
of circular triads, increases. 

For example, in the configuration of Fig. 1 there are five circular triads, 
ABD, ACD, AFD, AED and BEF. The maximum possible number is 8. 
Thus € = 0-375. 

10. ¢ can also be interpreted in the light of Table I. Suppose, in that table, 
we sum the rows. (The column sums are determined by the row sums and 
add no fresh information.) The sum of any row will be the a-number for that 
vertex in the polygon which corresponds to the object defining the row. T will 
then be the value of the sum of squares of deviations of row totals from the 
mean value (n — 1)/2, that is to say, will be the variance of the row sums multiplied 
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by n. ¢ is thus a linear function of this variance; but it cannot be tested in the 
x? distribution as if Table I were a contingency table, for the border cells are 
not independent or linearly dependent. 

11. If an individual observer produces a configuration of preferences which 
show inconsistence there are usually several explanations; he may be an 
incompetent judge, the objects may be so alike that consistent differentiation 
is not possible, or his attention may wander during the course of the experiment. 
We discuss these questions later. They are mentioned here to explain the motive 
for the next stage of the mathematics. With what probability can a value of 
€ arise by chance if the observer allots his preferences at random with respect 
to the quality under consideration ? 


With » objects there are 2 2) possible configurations of preferences. We 


n 
proceed to investigate the distribution of d in this universe of 2 2) different 
members. The method consists of proceeding from the distribution for n to 
that for (n+1). 

For x = 3 there are eight configurations, of which two give one circular 
triad and six no circular triads. Consider the effect of adding a new vertex D 
to the vertices ABC. Four cases arise: 


(1) all A, B, C: 

(2) D+ two of A, B, C. 
(3) D> one of A, B, C. 
(4) D- none of A, B, C. 


The last two are symmetrical with the first two and need not be separately 
considered. 

Situation (1) arises in one way and clearly does not add any new circular 
triads other than those already existing in'the configuration ABC. It therefore 
contributes six values d = 0 and two values d= 1. So does situation (4). 

Situation (2) arises in three ways, according as D<A, B, or C. The con- 
figurations so reached are similar and we may take any one, say D<C, as the 
single preference. If A<-C' then DAC is not circular and if B<-C the DBC is 
not circular. On the other hand A+>C and BC will each produce a circular 
triad. We then have the cases 


No. of circular 
triplets added 


A<C—B 0 
A-C—-B l 
A<C<B 1 
A-C<+B 2 
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We now consider AB. In the first two cases just enumerated the direction 
of AB does not matter and no circular triads are added. With the third A>B 
gives no circular triad but A<B adds one. With the fourth A+B adds one 
and A<-B adds none. 


Thus the number of circular triads occurring for these four cases is found 
to be 


No. of circular a 
triplets 

0 2 

1 

2 4 


We must multiply the frequency by three and by two to allow for similar 
symmetrical arrangements, and the final results are 


No. of circular 
triplets Frequency 
0 24 
1 16 
2 24 
Total 64 


The principles of this method are clear enough and the work may be 
formalized by a number of conventions which we omit to save space. In common 
with many similar combinatorial problems, however, troubles arise from the 
sheer number of possibilities and the difficulty of ensuring that nothing is 
overlooked. Up to the present we have found the distribution of d for n up 
to and including 7. The frequencies and probabilities are given in Table IT. 


12. For the values already obtained the moments are given by the following 
formulae : 


1 
(about 0) = i(s)> (3) 
3 
3 


My = )+29("3 )+9("7 ) +7}: (6) 


We have very little doubt that these results are true in general but can 
offer no rigorous proof. In so far, however, ac the moments are in a sense 
symmetric sums it appears highly probable that they are given by polynomials 
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TABLE II 


Frequency (f) of values of d and probability (P) that values 
will be attained or exceeded 


n=3 n=4 n=5 n=6 a=7 
Value 
of d 
f P f P f P f P a P f P 
0 2 1-000 6 1-000 | 24 1-000 | 120 | 1-000 720 | 1-000 5,040 | 1-000 
1 2 0-250 16 625 | 120 -883 960 -978 8,400 -998 
2 24 +375 | 240 -766 | 2,240 -949 21,840 -994 
3 240 531 | 2,880 -880 33,600 -983 
+ 280 297 | 6,240 792 75,600 | -967 
5 24 023 | 3,648 602 90,384 -931 
6 8,640 491 179,760 +888 
7 4,800 227 188,160 +802 
8 2,640 081 277,200 -713 
9 280,560 +580 
10 384,048 447 
11 244,160 -263 
12 233,520 | 
13 72,240 -036 
14 2,640 | -001 
Total| 2 8 = 64 —= — |2,097,152 — 


in n; and if this is so the values obtained are sufficient to establish polynomials 
of degree six or less. 


It is also to be noted that from the above values of the moments 
B= = ~3 + 12/n, 
from which it appears that a Type III distribution would fit the d-distribution 
fairly closely for moderate or large values of n. But as the distribution of d is 


of interest mainly for low values of n, which are all that occur in practice, it 
hardly seems worth while attempting to fit a curve. 


AGREEMENT AMONG SEVERAL OBSERVERS 


13. We now consider the investigation of similarities of judgments for 
m observers. Suppose that in a table of the form of Table I we enter a unit in 
the cell in row X and column Y whenever X-> Y and count the units in each 
cell. A cell may then contain any number from 0 to m. If the observers are in 


complete agreement there will be (5) cells containing the number m, the 


remaining (5) cells being zero. The agreement may be complete even if there are 


inconsistences present. 


Biometrika xxx1 22 
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Suppose that the cell in row X and column Y contains the number y. Let 


the summation extending over the n(n —1) cells of the table (the diagonal cells 
being ignored). & is then the sum of the number of agreements between pairs 
of judges. Put 


The maximum number of agreements, occurring if ( >) cells each contain m, 


\ 


is (3) (".) and thus in the case of complete agreement, and only in this case, 


u=1. The further we go from this case, as measured by agreements between 
pairs of observers, the smaller u becomes. The minimum number of agreements 
occurs when each cell contains m/2 if m is even or (m+ 1)/2 if m is odd. That 
is, if m is even, the minimum number of agreements is 


m 


2 = 4mom—2)(3), 


and in this case (9) 
m—1 


When m is odd the minimum value of wu is found to be 


14. We propose to call wu the coefficient of agreement. It is unity if and only 
if there is complete agreement in the comparisons. Its minimum value is not — 1 
except when m = 2. This, however, is to be expected in a measure of agreement 
for there can be no such thing as complete disagreement among three or more 
observers in paired comparisons. If observer P differs in certain corhparisons 
from observers Q and R, the two latter must agree on those comparisons. 
When m = 2, u reduces to 


n 

2 
and 2 becomes twice the number of cases in which the two observers agree 
about a comparison. wu is thus a generalization of a coefficient 7 proposed by 
Kendall (1938) to measure the correlation between two rankings. For general 


m, if the entries in the table were constrained to the ranking type, w would be 
the average intercorrelation 7 between observers taken two at a time. 


z= 3(?), 
% 
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“= 
2)\2 
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15. In discussing the significance of u it is desirable to know whether the 
set of preferences which give rise to it could have arisen by chance if the 
preferences had been assigned at random with respect to the quality under 
consideration. The procedure which first suggests itself is a generalization of 
the method used for the case of m rankings (Kendall & Babington Smith, 1939). 
That is to say, we sum the entries in the rows of the table and consider the 
variance of these entries. If the preferences are allotted at random we expect 
to find about equal numbers given to each object, and the variance will be low; 
in other cases it will be higher. 

The difficulty about this suggestion is that it has not been found possible 


to ascertain the distribution of the variance in the gn(2) possible sets of pre- 
ferences. The case m = 1, corresponding to the distribution of d for inconsistences, 
is difficult enough to solve. For higher values of m we have failed to find any 
distributions except in trivial cases. 

We can, however, offer a test based on the distribution of u (or 2). The 
comparative simplicity of the distributions in this case is in accordance with the 
remark made by Kendall in the paper under reference that the distribution 
of 7 is much simpler and much more regular than the distribution of the 
Spearman correlation coefficient p. 

16. Consider one cell in the table in row X and column Y and let it contain 
the number y. Then the corresponding cell in row Y and column X will contain 
m—y. Thus these two contribute to Y the amount (3) +("57) , 

Now, of the total ways in which the units can be distributed in the first cell 


there will be (") in which y units occur. Consequently the distribution of 2 


in the cell and the corresponding cell is given by the expression 


and since the distribution in other pairs of cells is independent if the preferences 
are allotted at random the distribution of X for the whole table is given by 


D(z) 
wher v=(3) 
rhere 


17. The distributions have been worked out for the following values of m 
and n:m=3,n =2 to 8;m=4,n =2to6;m=5,n=2 to 5; m= 6, n = 2 to 4. 
Tables III to VI give the probabilities based on these distributions, i.e. the 
probabilities that a given value of 2 will be attained or exceeded. 

For constant n the distribution tends to the Type III form as m tends to 
infinity. In fact, for a single pair of related cells the variate value corresponding 


22-2 


. 
‘ 
|, 


TABLE III 


The probability P that a value of E will be attained or exceeded, for m = 3, n= 2 to 8 


n=2 n=3 n=4 n=5 n=6 n=7 n=8 
P z z P z P P P 2 P 
1-000 3 1-000 6 | 1-000 10 | 1-000 15 | 1-000 21 1-000 28 | 1-000 
250 5 578 8 +822 12 17 -987 23 -998 30 1-000 
7 156 10 -466 14 *756 19 -920 25 -981 32 -997 
9 016 12 169 16 474 21 27 925 34 -983 
14 18 +224 23 29 -808 36 
16 | 20 -078 25 314 31 38 
18 0824 22 -020 27 148 33 -433 40 “736 
24 0035 | 29 057 35 +256 42 -572 
26 31 37 +130 44 -400 
28 -0430 33 0042 | 39 -056 46 250 
30 35 -0879 41 021 48 +138 
37 0712 43 0064 | 50 -068 
39 0712 45 0017 | 52 -029 
41 47 54 ‘O11 
43 0743 49 -0°68 56 -0038 
. 45 51 -0*10 58 0011 
53 “012 | 60 
55 -0812 62 -0°66 
57 0886 64 0413 
59 0°44 66 0522 
61 OM15 | 68 
63 023} 70 | -0740 
72 
74 
76 | -0M24 
78 
80 | -0148 
82 | 
84 
TABLE IV 
The probability P that a value of & will be attained or exceeded, for m = 4 and n = 2 to 6 
(for n = 6 only values beyond the 1°, point are given) 
n=2 n=3 n=4 n=5 n=5 n=6 n=6 | 
| 
: “= 6 | 1-000 12 | 1-000 20 | 1-000 42 | -0048 57 014 79 0842 
7 *947 13 “997 21 1-000 43 -0030 58 -0092 80 -0828 
125 8 14 “975 22 44 “0017 59 81 | -0°98 
9 455 15 901 23 *995 45 0°73 60 0037 82 0°15 
10 -330 16 24 ‘979 46 0°41 61 83 0°12 
ll 277 17 632 25 47 62 0013 84 
12 +137 18 524 26 48 090 63 86 0430 
14 043 19 410 27 805 49 0437 64 0°44 87 0117 
15 025 20 28 ‘719 50 65 0°23 90 04528 
18 ‘0020 | 21 185 29 621 51 0°93 66 0°13 
22 “137 30 52 0521 67 0*72 
23 053 31 413 53 “0517 68 0436 
24 044 32 +327 54 0°74 69 0118 
25 027 33 +249 56 0°66 70 
26 ‘019 34 179 57 -0738 71 0°47 
27 ‘0079 | 35 127 60 0°93 72 0520 
28 0030 | 36 090 73 0°10 
29 0025 | 37 -060 74 0°51 
30 0011 | 38 038 75 
32 0°16 39 024 76 0°78 
33 0495 40 016 77 0744 
36 0538 41 -0088 78 0715 


ty 
4 
1 
3 
4 
1 
4 
wee, 
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TABLE V 


The probability P that a value of E will be attained on exceeded, 


for m=5 and n=2t05 


a=2 n=4 n=5 
Zz P P z= P 
4 1-000 12 1-000 24 1-000 40 1-000 76 -0450 
6 375 14 -756 26 -940 42 -991 78 0416 
10 063 16 +390 28 -762 44 *945 80 -0°50 
18 +207 30 -538 46 *843 82 
20 +103 32 +353 48 -698 84 -0839 
22 34 -208 50 -537 86 -0810 
24 “O11 36 -107 52 +384 88 -0723 
26 0039 38 -053 54 +254 90 -0853 
30 0°24 40 C24 56 92 
42 -0093 58 -092 94 0°14 
44 -0036 60 -050 96 +0946 
46 “0012 62 100 0'29] 
| 48 64 
50 “0712 66 -0057 
52 -0428 68 -0025 
56 “0818 72 -0°39 
60 -0760 74 “0314 
TABLE VI 


The probability P that a value of X will be attained or exceeded, 


for m =6 and n = 2 to 4 


P 
‘000 | 
‘000 | 
‘997 | 
983 | 
945 
‘865 
-572 
-400 
-250 
-138 | 
-068 
029 
‘O11 
-0038 ‘ 
0011 
0°66 
0413 
-0522 | 
0°32 
0°40 
-0°36 
1924 
| 
01348 | 
412 | 
| 
| 
n=2 | n=4 
+ | | 
| | 6 1000 | 18 1-000 36 1-000 55 74 -0*12 
ee ee 688 19 969 | 37 999 | 56 029 | 75 | -0589 | 
10 219 20 38 ‘991 57 | -020 76 
15 O31 | 21 626 39 959 58 016 77 
0842 | | 92 523 40 -896 59 | 80 0°68 
‘0828 23 -468 41 “822 60 0072 | 81 0°17 | 
0°98 | 24 -303 42 ‘755 61 0049 | 82 0°12 
0°15 26 “180 43 669 | 62 00384 | 85 | ‘0734 
0°12 | 27 147 44 556 63 0025 | 90 0°93 
0195] | 28 ‘088 45 466 | 64 0016 | 
030 | 29 | -061 46 409 65 0°83 
0117 30 040 47 ‘337 (66 0°66 
01328 31 034 48 257 ‘| 67 048 
32 023 49 209 68 0°26 | 
35 0062 50 175 69 0°16 
36 01029 51 133 | 70 -0°86 1 
| 37 0020 | 52 097 71 0°68 | 
40 0°58 53 073 72 0448 “| 
| 45 0831 54 057 73 0°16 
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to a frequency (") is (™57)+(3). which is a quadratic in y. Were the 


variate value a linear function of y the distribution for the single cell would 
tend to normality in accordance with the well-known property of the binomial. 
The case of the quadratic value corresponds to a transformation of the variate 
of the type x? = y and the transform of the normal form exp (—?) dx becomes 
the Type III form exp(—y) y~*dy. Since the N cells are independent and the 
sum of variates in the same Type III form is also distributed in that form, it 


N 
follows that ¥ is in the limit distributed as exp (—2) 52 ‘d¥ except perhaps 
for some constants. Thus 2 or some multiple of it is distributed as y?. 

For constant m the distribution tends to normality with increasing n. 

18. The first of these results suggests that the Type III distribution will 
provide an approximation to the distribution (13) when m is moderately large. 
We proceed to find the first four moments of (13). 

It is sufficient to find the first four moments of (12), those of (13) being 
obtainable therefrom in virtue of the relationships which connect seminvariants 
of independent distributions. 

The rth moment of (12) about the origin is given by 


since 2” is the total frequency. Thus we have 
m*—m m m 
2%4,= 8 —mr + = 2" +8 (15) 
r=0\7 2 


Sums such as s| ‘ r? can be obtained by operating on the binomial (1 +2)” 


p times by Cg. we find 


Thus the mean of the distribution (13) is given by 


(16) 


i e 
Pi 
m m 
S = 2m — 
r 
and hence, substituting in (15), 
4 
(B 
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In a similar way we find 


(17) 


These are the moments of 2. Those of u are obtained by dividing by an 
appropriate power of x(3) and it may be noted in particular that the mean 
of u is zero. ; 


We have directly from (17), (18) and (19) 


8 (m—2)? 


iy" 


4 3N 
15m +17+—— m(m—1)}. 


For constant m, as N->oo, 


3 
and for constant N, as m->oo, 
8 12 


confirming the tendency towards the Type III distribution. 
19. The first four moments of the Type III distribution 


dF = ke-P? dx 


ppp’ pt 


2q 36 2 
are ) 


Equating the second and third moments to those given by (17) and (18) 
we find 


N -1 
2(m — 2)? 


2 


To make the first moments correspond we move the origin of the 2 dis- 


| 
| 
| 


340 On the Method of Paired Comparisons 


tribution a distance 3N (*) aS to the right. We thus reach the approximation 


to the 2 distribution, coinciding in the first three moments 


__22 Nm(m—1) 
dF = ke 2-1" dy, 
m\m—3 
where 2 = 5 
or, transforming to the more usual x? form by putting y? = 42:/(m— 2), we find 
that 
99 
is distributed as y? with 
“Nm(m-— 1) 
= ~(m—2)2 (23) 


degrees of freedom. 

The fourth moments of X and the yx? approximation differ by terms of order 
N-1 and m- compared with their absolute values. 

20. It only remains to be seen how large m and x must be for this to provide 
a satisfactory approximation. 

Consider first the distributions for m = 3. When n = 8, N = 28, we have, for 
the approximation, 42 distributed with 168 degrees of freedom. From Table III 
we see that for 2 = 54, P=0-011 and for 2 = 58, P=0-0011. Applying a 
continuity correction by deducting unity from 2 we find for the y? approximation 
with x? = 4x 53, v= 168, P= 0-011, and with y? = 4x57, P=0-00114. The 
correspondence is very close, in spite of the low value of m. 

For m = 4,n = 5, N = 10, the approximation gives 22 — 30 distributed with 
30 degrees of freedom. For 2 = 40 and 41, this gives, with continuity corrections 
of 0-5, half the variate-interval, y?= 49 and 51, y= 30. From the diagram 
given in Yule & Kendall’s “Introduction to the Theory of Statistics’’ (1937) 
it is seen that these values lie one on either side of the 1% value; and this is 
in accordance with the exact values of P, which are seen from Table IV to 
be 0-016 and 0-0088. Similarly we find that the values of 2, 37 and 38, lie on 
either side of the 5 °%, level, which is again in accordance with the exact values, 
P = 0-060 and 0-038. 

For m = 6, n = 4, N = 6, the approximation gives 2 — 33-75 distributed with 
11-25 degrees of freedom. For 2 = 59 and 60 the corresponding x? values are 
seen to lie on either side of the 1% point, which accords with the exact value 
of Table VI. 

We conclude that the x? approximation provides an adequate test of 
significance for the values of m and n outside the range for which Tables III-VI 
give exact values. 
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21. As a matter of theoretical interest we may record the results for the 
distribution of wu when the data are ranked. It appears that in this case 


12 2n+5 1 (2n + 5)? | 
m—22n?+6n+7 2 3(m — 2) 2n? + 6n4+7 
is distributed approximately as y? with 
m 2 n\ (2n+5)8 
(m— ») (Qn? 6n +7) degrees of freedom. 


This result is not of much practical value. The case of m rankings can be 
more simply treated by other methods. 


INTERPRETATION OF RESULTS OF PAIRED COMPARISONS 


22. In the light of the foregoing theory we may discuss the interpretation 
of the results of a paired comparison experiment. 

If for each observer the coefficient of consistence is unity the comparisons 
reduce to rankings and may be discussed by known methods. But if some 
or all the coefficients are not unity we have to consider the following 
possibilities : 

(a) Some of the observers may be bad judges and the inconsistences reflect 
their shortcomings in making comparisons. 

(b) Some of the objects may differ by amounts which fall below the threshold 
of distinguishability for some observers. 

(c) The property under judgment may not be a Sie variate at all and 
we may be getting the sort of confusion which would result if observers were 
asked to compare English towns according to the bivariate concept “ geographical 
position”. 

(d) Several of the effects may be operating simultaneously. 

23. If we have only one observer and have no prior knowledge of his 
capabilities it is not in general possible to apportion his inconsistences among 
these causes. Exceptions may occur when the inconsistences are of a marked 
and peculiar kind; for instance if they involve only four objects out of 15, we 
may suspect that the four are practically indistinguishable rather than that the 
observer is unable to make distinctions at all and avoided inconsistences among 
the others by sheer chance. But even here conclusions drawn a posteriori after 
inspection of the data are dubious. Table II gives a test of the hypothesis that 
an observer is incapable of making judgments. For example, with n = 7, the 
chances are 983 in 1000 that if the preferences are made at random there will 
be more than two inconsistent triads, so that if we find two or less, it is 
improbable that the observer is completely incapable of judgment. We might 
then be led to suppose that his small deviation from internal consistence is 
due to fluctuation of attention, very close resemblance to the objects giving 
rise to the inconsistences, or both. 
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24. With m observers the investigation can be taken a good deal further. 
If all the observers show inconsistences we suspect that the objects are at fault 
or that the observers are being asked to perform an impossible task. On the 
other hand, if most of the observers show a small or zero inconsistence we 
suspect that the others are just bad judges and may reject their data 
accordingly. 

As between indistinguishability of objects and non-linearity of variate, a 
choice of explanations would depend largely on the extent to which incon- 
sistences were concerned with the same set of objects. If there is a high value 
of u, indicating concordance of judgment, we expect to find most of the 
inconsistences confined to certain objects, and common to observers. In this 
case we suspect that the objects are close together in the degree to which they 
exhibit the quality under consideration. But if the observers scatter their 
inconsistences over the whole field % will be moderate or low and we suspect 
that the observers are being asked to do something beyond their capacity; and 
this brings us to question the validity of regarding the quality as a linear variable. 

25. When a quality such as “bravery” or “intelligence”’ is insusceptible 
to measurement there is frequently doubt of this kind. But this has not 
deterred investigators from assuming that such statistical variables exist, or 
from requiring observers to rank objects according to them, or in some cases 
from replacing such rankings by quantiles of the normal curve. We are never 
tired of criticizing this Principle of the Hypostasis of Plausible Terminology. 
Hitherto it has flourished largely because of the difficulty of adducing evidence 
against it; and we hope that the inconsistence of paired comparisons will provide 
a criterion, however rough, of the legitimacy of the methods to which it leads. 

But we would emphasize that our approach to the method of paired com- 
parisons has a somewhat different object from that elaborated by Thurstone 
(1927 and many subsequent papers). As we understand it, his method is appro- 
priate where one is entitled to assume a priort or by reason of precautions taken 
in the selecticn of material that a linear variable is involved and that there exist 
perceptible differences between the items presented for comparison. Our object 
is to make it possible to dispense with such assumptions and precautions. 

26. A few words may be added about the case in which an objective order 
is known to exist (as, for instance, in judging individuals according to age or 
weight). In such circumstances the appearance of inconsistences will indicate 
unreliability of the part of the observer or subliminal differences between 
objects. A measure of the observer's reliability may be obtained by calculating 
u between known and observed comparisons. If € is high enough to enable us 
to accept his judgments as internally consistent on the whole, uw may still be 
low enough to reject his judgments as accurate. 


27. We conclude with an example of the application of the foregoing theory 
to some experimental material. 
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Classes of children (ages 11 to 13 inclusive) were asked to state their 
preferences with respect to certain school subjects. Each child was given a 
sheet on which were written the possible pairs of subjects and asked to underline 
the one preferred in each case. Two classes gave the following results: 

(a) 21 boys, 13 school subjects. The preferences are shown in Table VII, 
which is in the form described in section, 13; e.g. there were 18 boys who 
preferred Art to Religion. 

TABLE VII 


Preferences of 21 boys in 13 subjects 


1. Woodwork — 14 20 15 15 16 16 18 18 18 2 21 20] 211 
2. Gymnastics 7— 412 13 18 4 16 16 W@W 16 18 19) 183 
3. Art WN 16 17. is 19 160 
5. History 6 8 710 — ll 12 4 15 13 16] 140 
6. Geography 14 14 13 13 #16 «215 #137 
7. Arithmetic 5 75 61H 7 PHBH B IU: 6 
9. English Literature 3 5 5 8 7 8 WW 9 — 10 13 13 15| 106 
10. Commercial subjects | 3 1 5 8 6 8 8 7 11 — 10 10 14 91 
| 
| Total 1638 


The calculation of 2 for this table, in which the objects are arranged in 
order of total number of preferences, may be shortened by noting that 2 as 
given by equation (7) may be transformed into the form 


E= (3). 


where the summation now takes place over the half of the table below the 
diagonal. Since the numbers in this half are smaller than those in the other 
half there is a considerable saving in arithmetic. 

We find 2 = 9718 


2x 9718 
and hence u= al 1 = 0-186. 


There is thus a certain amount of agreement among the children, indicated 
by the positive value of wu. Is this significant? 

We note first of all that this distribution of preferences could not have 
arisen by chance to any acceptable degree of probability. In fact, y? = 412-4 
(equation (22)) and y = 90-7. The large value of v justifies the use of the normal 
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approximation to the y? distribution and we find ,/(2x”) —,/(2v — 1) = 15-3 a very 
improbable result on the hypothesis of a random allocation of preferences. 
The distribution of circular triads was as follows: 


| wo. of triads Frequency No. of triads Frequency 


0 1 12 1 
17 3 
4 5 21 1 
6 2 25 l 
7 2 29 1 
8 1 39 1 
Total 21 


The total number of circular triads was 242 with a mean of 11-5. Only 
one boy was entirely consistent. On the other hand, for n = 13 the maximum 
number of circular triads is 91, with a mathematical expectation of 71-5. IJt is 
thus clear that, except perhaps for one boy, we cannot suppose that any boy 
allotted preferences at random. We are again led to conclude that the boys 
are genuinely capable of making distinctions, and that consistently on the 
whole. Half the boys have coefficients of consistence € greater than 0-92. 

We conclude that the boys can make preferences and that in their view the 
subjects are sufficiently different to enable a reasonably consistent set of 
preferences to be made. So far as these data are concerned we would see no 
objection to the assumption that a scale of preferences can be set up. With 
this in mind we can say that the value of wu indicates a certain amount of 
agreement, though not a strong one, between the boys as to which subjects 
they prefer. 

(b) 25 girls, 11 school subjects. Table VIII shows the data. 


TABLE VIII 
Preferences of 25 girls in 11 subjects 


1. Gymnastics — 10 19 17 20 17 21 21 21 18 22) 186 
2. Science 16 — 12 15.17 15 21 19 18 16 17) 165 
3. Art 6 13 — 16 16 18 10 17 16 19 16] 147 
4. Domestic Science 8 10 9 — 16 ll 13 15 14 11 «214;] «2121 
5. History 5 8 9 9 — 4 18 12 13 15 18] LI 
6. Arithmetic 8 10 7 14 11 — 12 13 12 16 18) 121 
7. Geography 4 4 15 12 7 13 — 14 15 14 14] 112 
8. English Literature 4 6 8 10 13 12 11 — 14 13 14 | 105 
10. Algebra 7 9 6 14 10 9 11 12 14 — 12] 104 
1l. English Grammar nH — 88 

Total | 1375 
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We find Y = 8928, u = 0-082. 

For the x? significance test, x? = 180-3, v = 62-4 and ,/(2x?) —/(2v—1) = 7-9, 
as before a very significant result. 

The distribution of circular triads was 


No. of triads | Frequency | No.of triads | Frequency 
1 2 17 1 
2 2 19 1 
3 1 22 1 
4 1 23 2 
6 1 27 1 
8 1 32 1 
9 1 35 1 

ll 2 37 1 
12 2 38 1 
13 1 
14 i Total 25 


The total number of circular triads is 382 with a mean 15-28. For n= 11 
the maximum number of circular triads is 55 with an expectation of 41-25. 
Several of the girls come very close to this, the worst having a coefficient of 
consistence equal to 0-31. 

We are, however, again led to conclude that the preferences were not 
allotted at random and that most of the girls are capable of exercising a judgment 
which is on the whole consistent. There is only a very slight agreement in 
preferences. 

Thus the girls are less consistent and less alike in preferences than the boys. 
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TRE MEAN AND VARIANCE OF ,?, WHEN USED AS 
A TEST OF HOMOGENEITY, WHEN EXPECTATIONS 
ARE SMALL 


By J. B. 8S. HALDANE, F.RS. 


Prarson’s measure of divergence, x?, can be used not only as a test of goodness 
of fit, but as a test of homogeneity when the expectations are unknown before 
the samples are observed. Consider the (m x n)-fold table: 


433 --- Agm | 
Any Ane ang Sn 

t, ty ts ee tn N 


Here each a,; represents a number of individuals observed, and s; = }a,;, 
j 


t; = N = Ys; = Lt;. The table may represent samples of 59, ..., 8, 

individuals, each sample falling into m classes, t,, t,, ..., t,, being the grand totals 

in each class. Or it may represent m samples of ¢,, ta, ..., t,,, individuals, each falling 


into n classes, the class totals being s,, 89, ..., s,. In each case we ask what is the 
probability of so bad a fit if every sample is taken from the same large population. 


it; 
The expected value of a,; is clearly Sis , and 


Fisher (1922) showed that when every s; and ¢; was sufficiently large, y? has 
the usual distribution, with (m—1)(n—1) degrees of freedom. Thus its mean is | 
(m—1)(n—1), its variance 2(m—1)(n—1). Haldane (1937, 1938, 1939) investi- 
gated the exact values of the moments of x? when expectations are small, in 
(m x n)-fold tables with mn or m(n— 1) degrees of freedom, but did not attempt 
the present problem. This had previously been done by Cochran (1936), in the 
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restricted case when m = 2, and every s; is equal. Cochran used a method based 
on the use of characteristic functions. My own method is entirely elementary, and 
completely exact, though rather tedious. My results are very close to Cochran’s, 
but the accurate values are worth putting on record. 


THE MEAN VALUE OF x” 
Given the marginal totals, the probability of any particular distribution is 
(s:!) 
v 


And this is so whatever may be the true expectations. Thus, if in the sample s; 
the expectation of the observation a,; is s;p;, so that that of t; is Np,, then the 
probability of the given distribution is 


il (s;!) 
But the probability of the given marginal totals ¢; is 
(p?) N! 


Hence P is the probability of obtaining any particular distribution with the given 
sample sizes s; and class totals t;. 


If E(x) denotes the sampling expectation of z, then E(a;;) = X Pa,;;,summation 
being taken over all samples possible with the given marginal totals. Hence 


E(a;;) = (N —1)!(@;;—1)! TT (@y!) 
kl 


N 


where k assumes all positive integral values between 0 and n except t, and / all 
positive integral values between 0 and m excep! j. That is to say the quantity 
summed is the probability of a set of observations similar to those of the table 
except that s; and t; have been diminished by unity. Hence the sum equals unity. 


8,8,;(t; — 1) 
= 
E(a;;4,.) = 


N(N-1)’ 


3 


348 The mean and variance of \” 


and so on. Hence 


2)—-y2—-NE 
K(x?)=X°=NE —N 


N 


= N?—nN—mN+mnN)—N 


(1) 


a result previously obtained by Bartlett (1937). 


This clearly becomes (m—1)(n—1) when N tends to infinity. In the case 
where m = 2, and every s; = s, we have 


—, (n—1)ns 
ns—s+1 
‘ 
Cochran gave 


This exceeds (2) by , and is therefore correct to 0(n-"). It will be noted 


that the mean Pet only on the number of degrees of freedom and the grand 
total N. The higher moments involve the marginal totals s; and ¢;, and are 
therefore more complicated. It is clear that the expectation is diminished because, 
in sampling from a finite group of NV individuals, E(a,;) has the same value as in 
sampling from an infinite population with the same frequency, but E(a},) is less. 


THE VARIANCE IN A (2X ”)-FOLD TABLE WITH EQUAL SAMPLES 


Owing to the complexity of the general expression for the variance, we shall 
first calculate it for the (2 x )-fold table: 


A 


1 
dX (8,4; 8; t;+N) N 
: 
Po 
ag 
. 
4 
2 
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Here n samples of s are taken, and each falls into two classes, say a; healthy 


and b; diseased. The totals are A and B, and N = ns = A+B. It will be con- 


venient to put A = pN, B= qN. 


1 
x ene (sp—a,)? 


Pq 
spq 
nps\? 1 
Now (> a?)? = Dat+2 aia}, 
i 


summation being taken over all unequal pairs of values of ¢ and j. 
Hence 


E(2a?)? = nE(a}) + n(n — 1) 
= nEf{a,(a;— 1) (a;— 2) (a;— 3) + 6a,(a; — 1) (@; — 2) + — 1) 
+n(n—1) 1) + 2a,(a;— 1) a; +a;a,] 


A(A—1)(A—2)(A-3 


A(A-1)(A-! 
+ [6ns(s— 1) (s— 2) + 2n(n— 1) 8%(s—1)] 


+ 1)+n(n— 1) ns. 


Since NV = ns, A = pns, we have 


l _ (4—1)(4—2)(4-3) 
= (N—1)(N—2)(N—3) 1) (m+ 6] 


2(A—1)(A—2) 


(NW —1)(N—-2) (s— 1) [(n+2)s—6] 


[m+ 6)s—7] +1, 


E(X ai)? = (ns — 1) (ns — 2) (ns — 3) 


— 1) {ns®— (n+ 4) 8+ 6} 


+ 2p?n(n — 1) s(s— 1) (ns—6) 


+ p(n —1) {n(n + 1) — 2(3n — 4) s— 2} — 2(n— 1) (s— 
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So the variance of x? is given by 


= [(n—1)ns pns |? 


p*q?s* (ms—1) q 


_ E(X az)? n®s*{pn(s— 1) +n—1}* 


+ 2p?n(n — 1) 8(s— 1) (ns — 6) 
+ p(n — 1) (nn + 1s? — 6n — 8s — 2) — 2(n— 1) (s—1)} 
— ps(ns —- 2) (ns — 3) {pn(s—1)+n—1}*] 


2n*(n — 1) 8(s — 1) (pqn*s? — ns + 1) 
pq(ns — 1)? (ns — 2) (ns — 3) 


_ 2n4(n— 1) 8*(s— 1) ns—1 


_ NN —n) 1 N-1 


This becomes 2(n—1) when A and B tend to infinity. 
When A and B are both of order N, that is to say neither p nor q is small, 


it becomes 
2(n— | -(n+—-7)N- | -2), 
(n—1)} 1 (n+ +0(N-*) 


Cochran’s expression is 2(n—1)(1—nN-1)+0(N-*), which is accurate when 


p=}t - , i.e. 0-8273 or 0-1727. It will be seen that the variance is diminished 


when n>7 which is certainly the case if n> 3. If A remains finite when NV 
and B tend to infinity we find y? = n—1, 


V(x?) = (1-4), 


a formula already given by Haldane (1937). 
Formula (3) may be compared with the values given by Haldane (1937) for 


& 
| 
(3) 
A 
| 
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a (n x 2)-fold table with n degrees of freedom. In the terminology of this paper 


they are: =) 
y 


= m+ 
V(x?) 6). 
Similarly the limiting case given above may be compared with the values for 
a n-fold table with n degrees of freedom, namely, 
=n, 


n 
V(x*) = 


THE VARIANCE FOR A (m X n)-FOLD TABLE 


This of course includes the variance found above as a special case. However, 
as the summations involved are somewhat hard to follow, the simpler algebra 
for the special case has been given separately. 


a’. 
x = NW »(52) 
it; 
a2, 
8,t; 
Here summation over the values of 7, k on the one hand, and j, / on the other, are 
independent and commutative. }(x) = =[=@)] , where k assumes all values 
ik ilk 
from o to n inclusive, except 7. (x) has a similar meaning. 
jl 
Thus = n, ¥(s;) = N, =(1)] = 2%, = nN, and so on. 
i i i k i k 
Let S= T= 
Then 
at, = 1) — 2) (@,; — 3) + — 1) — 2) + — 1) 
= — 1) — 1) + — 2) Ag; +. — 1) +055 


and so on. The expectations of the two middle terms in the last expression are of 
course equal. Remembering that 


2) (s;—3)t,(t; — 1) (t; —2) (,-3 
Blaglayg—1) (4-2) = 


— 1)t,(t; — 1) (t; — 2) (t; -3 


23-2 


| 
| 
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and so on, we have 
N~*E[(x? + N)?] 


~ 8;tj 
iki t; 
ijl 8; ikjl 


1 (8; — 1) (8;— 2) — 2) 


(s:— 1) (4-1) ¢;—2) 
j 


(s;— 1) (8; — 2) (t;- 1) 


+ 2> +2> 
ikj iil 


+23 
Hence 
NE[(x? + N)?] 


l 
a wow — 68; +11 — 687%) + 1) 


64411-6794 
2 
(W-2) 3+ 287") —3+ 274) 


+ 3+ 2571) + 342672) + | 


1 


~ (N=1)(N-2)(N— 


x [N? — 2(m +2) N +m?+ 


x [2(N — 3n + 28) (N —3m +27) + (nN —n?— 2n + 28) (mN —m?— 2m + 2T)] 


(m—T) + (n?—S) (m2— T)] + ST. 


Ly 
7 
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Hence 
—1)(N —2)(N —3) E[(x?+ N)*] 

= [N2—2(n +2) N +n2+ 10n—6S] [N2—2(m +2) N + m2+4 10m—6T] 
+ 2(N — 3) [(nm +2) N2 
—{nm(n +m) + 4nm + 6(n + m) — 2(n + 2) T — 2(m+ 2) SIN 
+n?m? + 2nm(n +m) + 22nm — 2(n? + 8n) T — 2(m? + 8m) + 128T) 
+(N—2)(N —3) [n2m2?+ 6nm — (n? + 6n) T — (m2 + 6m) S+ 7ST] 
+(N—1)(N—2)(N—3) ST 

= N44 2(a—f—2) 66+ 4] N? 
+(—3a?+ 8af — + 6a—4f) N + a? 4a—[(n?+ 2n—2)T 
+ (m? + 2m — 2) 8S] N? + [(n?— 2n) 7' + (m?— 2m) 8S] N+ N*(N +1) ST, 


where a = nm, 8 = n+m. 


But = 
_ N(N+a-A) 
Hence 


— 1)? (N — 2) (NW —3) V(x”) 
= (N—1)[N*4+ 2(a—f— 2) N*+ete.]— N(N 
= 2(a—f+1) N34 (a? + 28-4) 
— a(x — 28+ 4)—[(n?+ 2n—2) T + + 2m— 2) S] 
+ [(n? — 2n) T + (m2? — 2m) S] N(N —1)+ STN*(N2-1). 
Thus 


N 
1) — 2) (N—3) 


2(n — 1) (m— 1) + (n?m? + 2n + 2m — 4) N? 
— 2{nm(n — 1) (m— 1) + (n+ m) (n+ m— 2)} N —nm(n— 2) (m—2) 
— {(n? + 2n—2) T+ (m?+2m—2)S}N2 
+ {n(n — 2) T + m(m — 2) S} N+ STN*(N + 1), 


where S and 7' have been defined on p. 351 above. 
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When there are n samples of s,, 33, 83, ..., 8,, members, falling into two classes 
whose totals are A and B, we have m = 2, T = a, § = 2Zs;', 


+1) —(n2+2n—2) N+n(n—- 2 |. 


, it becomes 
VO" = | 
. N3(N +1) 
Mle (a+b) (a+c)(b+d) |" ...(6) 


DiscussIOoN 


The expressions for the higher moments would clearly be a great deal more 
complicated. The above calculations have, I think, a twofold interest. They show 
that the loss of one degree of freedom arises from the fact that we are sampling 
from a finite, and not an infinite, aggregate. And they point the way towards 
an exact treatment of the problein of curve fitting, for which Pearson originally 
designed the measure x”. In the case of a (n x 2)-fold table with (n— 1) degrees 
of freedom we are, in effect, asking whether the observations give a satisfactory 
fit to a horizontal straight line y = k, where y is the observed frequency of a type 
within a sample. If we were trying to fit a line y = k + lr, we should have n—2 
degrees of freedom, x having a different value for each sample. If we were trying 
to fit a normal curve we should have n—3 degrees, and so on. Jeffreys (1938) 
has pointed out the great difficulties of curve-fitting in such cases when expecta- 
tions are small. It is clear that the expected value of y? in such a case is not 
exactly n—3.° It may turn out to be slightly greater. Thus an investigation of 
the actual law of error ina particular type of observation will demand an extension 


of the present investigation to cases where several more degrees of freedom are 
lost. 


‘ 
72 NU 
oa When every s; = s, this reduces to formula (3). In the case of the fourfold 
“ : 
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A NOTE ON THE STATISTICAL ANALYSIS OF SENTENCE- 
LENGTH AS A CRITERION OF LITERARY STYLE 


By C. B. WILLIAMS, Sc.D. 
Department of Entomology, Rothamsted Experimental Station 


Some years ago I made a number of calculations of the frequency distribution 
of words of different length in different books to see to what extent authors 
kept to a definite distribution and so perhaps might be identified by such a 
method. The results obtained, however, were not striking and the work was put 
at one side. . 

Mr Udny Yule (1939), however, has attacked the problem of authorship 
from the angle of the variation in sentence length, and this appears to be a 
much more fertile method of approach. 

Mr Yule shows that the frequency distribution of sentence length (i.e. 
number of words between successive full stops) is of the skew type and by 
comparing in two different manuscripts, the mean, the median, quartiles and 
deciles he is able to produce convincing mathematical evidence on the identity 
or otherwise of their authorship. 

Mr Yule does not comment on the skew distribution further than to state 
(p. 371) “they are not of the Poisson type, but of the type in which the square 
of the standard deviation largely exceeds ihe mean”’. 

When I converted some of Yule’s tables into diagrams I was struck by their 
general resemblance to certain skew distributions with which I have recently 
been dealing in some Entomological problems, and which distributions, I found, 
became normal and symmetrical if the logarithm of the number was taken as 
a basis for subdivision into groups instead of the number itself (see Williams, 
1927). 

| was unable to test this transformation on Yule’s figures as he unfortunately 
does not give the original data, but only the word length of sentences in groups 
of five; so it was necessary to obtain some new data. 

These [ obtained by counting the number of words in each of 600 sentences 
from the following three books: 

(1) G. K. Chesterton, A Short History of England, 1917. 

(2) H. G. Wells, The Work, Wealth and Happiness of Mankind. 

(3) G. Bernard Shaw, An Intelligent Woman’s Guide to Socialism. 

All three works deal with the exposition of somewhat similar sociological 
subjects and none of theim are in the “conversational” style. 

The selection of the sentences was randomized as follows. Each of the books 
is divided up into chapters, sections or both, In Chesterton’s book the first 30 
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sentences were counted in each of the first 20 chapters. In Wells’s book the 
first 10 sentences were counted in each chapter subdivision up to chapter vil, 
division 11. In Shaw’s book the first 15 sentences in each of sections 1-40 were 
taken. In each case the greater part of the book was covered. 

The original data thus obtained are shown diagrammatically in Fig. 1. 
Each of the distributions is of the typical skew type obtained by Yule: Shaw is 
the most extreme and varies from 3 words to 143; Wells is less skew and ranges 
from 3 to 91; while the Chesterton curve is the least skew and varies from 5 to 91 
with only two values over 60. 

From Table I it will be seen that the arithmetic mean number of words per 
sentence is 25-87 for Chesterton, 24-11 for Wells and 31-23 for Shaw. The 
medians are also different and presumably the quartiles and deciles, but these 
latter were not calculated. - 


TABLE I 
Frequency constants of the distributions of sentence length 
Chesterton Wells Shaw 

| 
Number of sentences : 600 600 600 
Number of words 15,521 14,463 18,735 
Arithmetic mean no. of words 25-87 24-11 31-23 
Median no. of words 25-3 20-8 ; 26-0 
Mean log no. of words 1-37 1-31 1:39 
Geometrical mean no. of words 23°5 20-5 24-5 
Standard deviation of mean log 0-200 0-237 6-290 
Standard error of mean log 0-0080 | 0-0095 0-0112 


If, however, instead of taking the frequency distribution of the actual 
number of words per sentence we take that of the logarithm of the number we 
get the distributions shown in Figs. 2-4. They undoubtedly show a very close 
resemblance to the “normal distribution’. The mean log and standard deviation 
for Chesterton is 1-37+ 0-20; for Wells 1-31+0-24 and for Shaw 1-:39+0-29. 
The standard error of the mean is, owing to the large number of observations, 
in all cases very small and approximately + 0-01. 

On each of the three figures is superimposed a normal curve of the same 
area, mean and standard deviation and it will be seen how closely it fits the 
observed values. 

The following comments may, however, be made: 

(1) The greater irregularity of the observed values in the lower portion of 
the distribution is due to the irregular distribution of the logarithms of integers 
when grouped in small artificial divisions as in the present case. Thus there is no 
logarithm of an integer between 0-01 and 0-25; none between 0-61 and 0-65 
and between 0-71 and 0-75. On the other hand, there are two between 1-11 
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and 1-15; two between |-16 and 1-20; only one between 1-21 and 1-25 and three 
between 1-26 and 1-30. Thus in all three diagrams the frequency of 1-21—1-25 
is well down and that of 1-26—1-30 is far up. Some process of smoothing would 
undoubtedly eliminate these irregularities, but it was thought better to leave the 
data in their original form and draw attention to the sources of irregularity. 

(2) There appears to be on all three diagrams a slight shortage of high values 
and a slight excess of low ones. On the latter I have no comment but it appears 
possible that the small deficit in the longer sentences might easily be due to a 
biased effect introduced by the habit that many writers have of cutting up 
unusually long sentences into their component parts when reading over their 
manuscript or proofs. 

On the assumption that the normal curve is a sound representation of the 
frequency distribution of the log number of words in sentences, Fig. 5 has been 
prepared which shows the means and normal curves for the three books super- 
imposed on one another. The means are close together but the distributions are 
very different. 

The difference in means between Shaw and Wells is 0-09 and the standard 
error of the difference is only 0-015. Thus the difference is six times the standard 
error and hence certainly significant. Between Shaw and Chesterton the 
difference of the means is barely significant but that between the standard 
deviations is quite striking. ; 

If the above reasoning is correct, it is unnecessary for the comparison 
between two documents to compare arithmetic means, medians, quartiles and 
deciles, but only the log mean and the log standard deviation; all other com- 
parisons are included in these. 

It follows also that Mr Bernard Shaw, while undoubtedly under the im- 
pression that he was punctuating at his own free will, was for this particular 
book hide-bound within the limits of 

(1-4-2)? 
= Jam P| | 
while similarly Mr Wells was writing under the restricting influence of 

0-24 /(277) 2(0-24)2 | 
where Z is the frequency and x the logarithm of the number of words per 
sentence. 


It is also perhaps worthy of passing comment that the curve representing 
Mr Shaw is short and broad while that representing Mr Chesterton is tall and 
slender; which shows how necessary it is to use these curves only for the purpose 
for which they were originally designed. 

Perhaps something might be added on the meaning in words of the above 
mathematical transformation. If the log distribution is normal we can infer 
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that the extent to which a sentence in the process of writing is likely to vary is 
at any level proportional to the length of the sentence. Thus when he is thinking 
in short sentences of about 10 words an author is likely to vary say from 8 to 
12 words; when he is thinking in longer sentences of say 100 words he will vary 
from 80 to 120. In other words the variations are proportional or geometric 
and do not merely involve the addition or subtraction of x words at all levels. 
Further, if the geometric mean is taken as a basis, sentences between this and 
half its length are as frequent as those between it and twice its length ; sentences 
down to one-quarter its length are as likely to occur as sentences up to four 
times its length; and so on. 

If the arithmetic mean were the true basis then sentences of 10 words more 
than the arithmetic mean would be as likely to occur as sentences of 10 words 
less, and it is easy to see that this is not the case. 

Before the whole theory of the use of such distributions for separating works 
of different authorships can be fully accepted it will of course be necessary to 
study the results obtained from many different works by the same author, in 
different styles, on different subjects and at different periods of his life. From 
these it may be possible to find what variation can occur “within authors” as 
compared with “between authors”. This note is not meant to deal with this 
basic problem but only to draw attention to the simplification of the method 
of approach to such a problem by the use of a transformation which produces 
a normal instead of a skew distribution. 
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APPLICATIONS OF THE NON-CENTRAL t-DISTRIBUTION 


By N. L. JOHNSON anp B. L. WELCH 


1. INTRODUCTION 


STATISTICAL problems arising in connexion with the normal distribution are 
simplified by the circumstance that the sample mean Z and standard deviation s* 
are jointly sufficient estimates of the corresponding population parameters £ and 
o. Also the distributions of Z and s have simple forms and together with Student’s 
t-distribution provide complete solutions of any problems of testing hypotheses 
or of estimating fiducial limits relating to either £ or o singly. In the present paper 
we shall consider some of the questions which arise when our main concern is not 
with either € or o alone, but with some function of the two. In particular we shall 
consider a number of cases which all lead to the use of what has been called the 
non-central t-distribution. Tables of the probability integral of this distribution 
will be given in a form suitable for the solution of the problems raised. 


2. THE NON-CENTRAL f-DISTRIBUTION 
Let z be a quantity distributed normally about zero with unit standard 


deviation and let w be a quantity distributed independently as x?/f, where f is the 
number of degrees of freedom of the y?. Then if ¢ is defined by the equation 


8 
(1) 


where é is some constant, then ¢ is distributed in a manner depending only on 6 
and f. This distribution will be termed a non-central t-distribution. When d equals 
zero we have the familiar Student’s t. 


In general the elementary probability distribution of ¢ is given by 
(Sf) f+t 0 
© yf 
where Hh, (x) = (3) 
OJ: 
An account of some of the properties of the above-defined Hh function has been 


given by R. A. Fisher (1931). He derives a result equivalent to the above equation 
(2) although with a different notation, his ¢ being equivalent to our t/,/f and his 


* 3? = 3(x—Z)*/(n—1), where n is the sample size. 
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T being our 4/,/(f + 1). Tables of the Hh function have been calculated by J. R. Airey 
(1931). These tables, however, are not useful for solving the problems which are 
considered in the present paper. For these problems the probability integral 
of the non-central ¢-distribution is required. Existent tables of this integral 
(J. Neyman, 1935 and J. Neyman & B. Tokarska, 1936) have been calculated only 
for one rather restricted purpose. In the present paper much more extensive 
tables will be provided which, it is hoped, will cover all the applications of the 
non-central t-distribution likely to be encountered. 

The probability integral of non-central ¢ demands a table of triple entry, since 
the probability that t exceeds to, say, depends on f, 6, and f,). The notations 


will be used to denote this probability. 
Often it is necessary to find what value of é, is such that the probability (4) 


will take a specified value €, say. This t, will be a function of f, 6, and € and it will 
be convenient to denote it by ¢(f,6,¢). Thus 


P{t>Uf,8,€) |f,8} =. 


Again, often f, will be given and the value of 6 which makes (4) take the value e, 
will be required. It will be convenient to denote this value of é by 4(f, to, €). Thus 


P{t>ty | f, O(f, to, €)} = €. 


Space does not permit the tabling of all the three functions P(f, 6, t,), t(f, d,€) 
and 4(f, to, €). For reasons which will become clear later in the paper it was decided 
to table 4(f, tp, €) most fully. Table IV, given at the end of the paper, facilitates the 
direct calculation of d(f,t),¢) for seventeen probability levels ¢. It can also be 
used without difficulty for calculating t(f, 6, €). Table V is an additional short table 
from which ¢(f,d,¢€) can be calculated rather more directly but only for the 
probability levels ¢ = 0-05 and € = 0-95. These tables are given in the most suitable 
form which we have been able to evolve consistent with necessary limitations on 
size. 

Before the tables are described, the next three sections wil! be devoted to a 
description of some of the situations where they are required. 


3. COEFFICIENT OF VARIATION 


The first function of § and o to be discussed will be the coefficient of variation 
V=o/€. In the practical situations where this index is an appropriate measure 
of variability, the variable x is usually necessarily positive. Now for a normal 
population the ratio of the mean to the standard deviation has to be of the order 
of 3 or more, for the chance of a negative x to be negligible. Strictly speaking, 
therefore, the sampled population should not be assumed normal if the coefficient 
of variation is too great. The figure 33 % has often been stated as the permissible 
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upper limit. In practice coefficients of variation are usually much smaller than 
this, and it is assumed in the present section that we are dealing with such cases. 

An estimate of V is provided by the sample coefficient of variation v = s/Z. 
Now since we may write 


v 8 | o o (5) 


it appears from comparison with (1) that ,/n/v is distributed as non-central ¢ with 
f = (n-1) and 6 = ,/n/V (McKay, 1932). The solution of problems relating to V 
is therefore easily effected. 

For instance, suppose it is desired to test whether the sample contradicts the 
hypothesis that V < V,, and that we decide to reject the hypothesis when v > vy, 
where v, is chosen so that P(v > v9 | V = V,) is equal to some specified small chance 
e. In the notation of the previous section v, will then be given by 


_i 


Again, consider an example where it is decided to reject a sample as unsatis- 
factory when v is greater than a given value vp, and where it is required to know 
how low the true coefficient of variability should be kept to ensure that the 
probability of rejection will not exceed a given e¢, i.e. we require Vy such that 
P(v>v.|V=V,) =e. In the notation of Section 2, V, is given by 


0 0 


Or finally suppose that a value of v is observed and an upper fiducial limit of 
V is required so that the chance is ¢ of this limit being exceeded, i.e. a lower 
fiducial limit of ,/n/V is required. Now since 


v V 
and since the inequality > -1, 
is equivalent to the inequality* 
1, 
V v 
it is seen that the required upper fiducial limit of V is 


* This follows from the fact that ¢(f, 6, €) is a monotonically increasing function of 6. 
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4. THE POWER OF STUDENT’S f-TEST 


Suppose it is desired to test the hypothesis H, that the mean £ of a normal 
population has the value £,. Student’s t-test consists in calculating from the data 
th tit: 

8 


and referring it to the usual central t-distribution with f = (n—1) degrees of 
freedom. Thus if the only alternative hypotheses to be taken into consideration 
are those for which £ > £5, the test will consist in rejecting the hypothesis when 
t>to, where t) is such that P(t>t,|H,)) =, and ¢€ is the conventional level of 
significance. In our notation ; 

(10) 


Now when H, is not true, but £ has some alternative value é,, it is often required 
to know how powerful Student’s test will be, i.e. what chance the test will have of 
rejecting Hy. But we can write (9) in the form 


= 


whence, comparing with (1), it now appears that the quantity calculated from the 
data is distributed in the non-central ¢-distribution with f = (n—1) and 


= Jn | 
The power of the test is therefore P(n—1,,/n (€,—&)/o,t9) and the value of £, 
for which the power reaches any specified value 9, say, is given by 

Tables for evaluating the power of the t-test together with a discussion of their use 
have been given by J. Neyman (1935) and J. Neyman & B. Tokarska (1936). 
They are not restricted to the simple case of testing whether a mean has a 
specified value but apply to all cases in which the t-test is used. 


5. PROPORTION DEFECTIVE PROBLEMS 


Another class of problem where the non-central ¢-distribution has an applica- 
tion, occurs when objects are classified as defective or non-defective according to 
whether they have values of a characteristic exceeding or falling short of a fixed 
standard. Thus, if an object is defective when the character x exceeds a fixed given 
level L, then information will often be required about the proportion P falling 
beyond L in the population. If the population be normal P clearly depends only 
on the ratio 
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An estimate of U is provided from the sample by calculating 


_ (L-2%) 


Corresponding to deviate u the equivalent normal probability p will then give 
an estimate of P. Chis situation has been discussed recently by W. J. Jennett & 
B. L. Welch (1939).* The transference from U to P and frorh u to p does of course 
require that the sampled distribution is really approximately normal and for 
this reason care must be taken in going out to the “tails” of the distribution 
(i.e. when U is large and P small). 

In order to allow for the sampling errors arising from this method of estimating 
P we have only to note that (13) may be written 


Hence ,/nw is distributed as non-central t with f = (n—1) and 6 = Jn U. Thus if 
the proportion P (and hence U) were known the value wu, of wu such that P(u > u,) =€ 
would be given by 

(15) 


Conversely, given u from a sample, a lower fiducial limit of U is obtained by 
noting that the inequality pete 

Jnu >t(n—1,,./n U,€) 
is equivalent to the inequality 

(16) 


Hence a lower fiducial limit for U will be given by 6(n— 1, ./nw,e)/,/n. In the above 
analysis the proportion falling beyond L has been termed a “proportion de- 
fective”’. In industrial problems this is often a fair description. More generally 
the analysis refers to any problem where we are primarily interested in how a 
measurable object is related to a fixed level. A slightly different problem occurs 
when we are not given L but are asked to estimate the value of Z so that P will 
have an assigned value. For instance, the value of Z may be required so that the 
chance is 1 in 20 of it being exceeded. 

Now if U is the normal deviate which is exceeded with probability P, then the 
required L is related to and o by the relation 


L=£+U6e. 
An estimate of L is therefore given by 
(17) 
* For a general discussion of similar problems arising in the control of industrial products the 


reader may be referred to the British Standards Institution Publication, No. 600 (E. S. Pearson, 
1935). 
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If fiducial limits are required for L they may be obtained by returning to equations 
(13) and (15). Z is not now given, but U is definitely known and therefore (15) 
provides the value of wu, such that P(w>u9) = ¢. But the inequality 


L-=% 
u= > Uy 
can be reversed to read L>%+ugs. 
Hence, if we write l= (18) 


we shall have P(L>1,) = ¢«. Hence l, will be a fiducial limit for L, which will be 
exceeded by L in a proportion € of cases. 


6. LARGE SAMPLE RESULTS 
Before going on to discuss the distribution of non-central ¢ in general and the 
method of applying the tables given below, it will be convenient to consider the 


situation when f is large. The distribution then approaches the normal form. The 
important question is how quickly. 


The first three moments of ¢ are given exactly by the following expressions: 


r(f} 


(19) 
f 
r(3) 
2 
f(2f-—3+6?) 


If the gamma functions are expanded in powers of 1/f and only the leading 
terms are retained, these give approximately 
af | 
4f 


In large samples, therefore, ¢ becomes normally distributed about 6 with standard 
deviation (1 5) . The rapidity of the approach to symmetry is indicated by the 


true values of ,/f, in Table I. For given f the greatest value \/f, can take is shown 
in the last column. 


* The term é*/fis retained because in most problems é will be of the order /f. 
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TABLE I 
NB, of t for different f and 6 
0 1 2 3 fo 
f 
4 0-00 3°70 4-61 4-86 5-09 
6 0-00 1-67 2-13 2-26 2-38 
8 0-00 1-20 1-55 1-65 1-74 
12 0-00 0-84 1-10 1-18 1-25 
24 0-00 0-53 0-69 0-74 0-79 
foe) 0-00 0-00 0-00 0-00 0-00 


It is seen from Table I that values of ,/f, of the order of unity are likely to 
occur fairly frequently in the type of problem which has been discussed in the 
previous sections. This represents a considerable degree of skewness and another 
method of making normal approximations, outlined below, is preferable. Before 
describing this we may note that if ¢ is referred to a normal scale with mean é 


2\3 
and standard deviation (1 +5) then the following approximations to the 


quantities defined in section 2 will be obtained: 


where K, is the deviate of the unit normal curve exceeded with a probability e. 
Also 4(f, to, €) will be given by the solution of the equation in 6 


ty = 8+ K, (145). 


On rationalization this equation is quadratic in 6 and the correct root to take will 
be obvious. 

Better approximations, which are, however, still based on the normal curve 
may be obtained as follows. The probability that ¢ exceeds a given value f, is 
by (1) the probability of the inequality 


_ 
t= (25) 
and this inequality is equivalent to 
(26) 


Now (—z) is a unit normal deviate, and ./w, being of the form y/,/f is very nearly 
normally distributed even for small f. Since z and ,/w are independent, (—z +t) ./w) 
must therefore be more nearly normally distributed than ./w, whatever t,.* 


* This is practically obvious, but a demonstration is given in the Appendix, equation (43). 
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Hence an upper limit to the skewness of (—z+t,./w) is given for different f by the 
values of ,/2, in Table II. Comparison with Table I shows that it is better to take 
(—z+t,,/w) as normally distributed rather than t. The procedure is then as follows: 


Write mean,w=4; Gy» = (27) 


Then (—z-+t,./w) is taken to be normally distributed with mean at, and standard 
2; 
0 


2\4 
deviation (. + 7) . For given ¢, the value of é such that P(t > t)) equals € is given 
by seeking 6 such that the probability is ¢ of (26) being true. This gives 


2\% 
8(f to €) = aty~K,(1 (28) 
2f 
TABLE IT 
VB, of Vw for different f 
f 4 6 8 12 24 2 
VA, 0-41 0-32 0-27 0-21 0-15 0-00 


Conversely for given 6 the value ¢(f, 6, €) which will be exceeded with probability 
will be approximated by solving the equation 

On rationalization this becomes a quadratic in ¢. This method of approximation 
was given by W. J. Jennett & B. L. Welch (1939) together with a short table of 
a and b. However, since a and 6 differ from unity by a quantity of the order 1/f 
and since errors at least of this magnitude are involved in the assumption of 
normality, it would perhaps have been as logical to take a and 6b equal to unity 
in the approximation. The numerical comparisons set out in Table III indicate 
that nothing is lost by doing this and also show how inferior is the method of 
approximation which assumes ¢ itself to be normally distributed. 

Therefore the approximation which we shall adopt as being the best one, 
requiring only the use of the normal probability scale, will consist in assuming 
(—z+ty Jw) to be normally distributed about ¢) with standard deviation 


The value of d such that p(t > t,) equals € will then be approximated by 


é = at—K,(1+ | 


4 
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TABLE III 
Values of 8/./f 
f 
True Approx. 1 | Approx. 2 | Approx. 3 
4 0-99 2 0-056 0-736 — 0-088 —0-015 
4 0-01 2 3-972 12-023 3-848 4-015 
: 0-99 5 1-120 2-668 0-565 0-726 
4 0-01 5 9-247 29-230 8-840 9-274 
24 0-99 2 1-181 1-345 0-956 1-178 
24 0-01 2 2-819 3-163 2-800 2-822 
24 0-99 5 3-290 3-677 3-213 3-255 
24 0-01 5 $-752 7-593 6-687 6-745 


The values correspond to certain f, ¢ and ¢,/,/f and have the property that p(t>t, | f, 6) =e. 
Approximation 1 is the value obtained from equation (23), approximation 2 by using (28) with the 
correct a and 6 and approximation 3 by using (30), i.e. taking a = 6 = 1. 


Conversely, for given 4, the value ¢(f, 6, ¢) which will be exceeded with probability 
€ will be given implicitly by 


(31) 
which on rationalizing and solving becomes 
Uf, 8,6) = (32) 
(1-3) 


The approximations (30) and (32) are useful even in moderate-sized samples. 
It is not intended, however, to enter here into any detailed discussion of how good 
the various approximations are. They have been considered primarily because 
they form the basis of the exact tables which are given at the end of the paper and 
which are described in the next section. 


7. DESCRIPTION AND USE OF TABLES 

The tables are intended to facilitate the exact calculation of the quantities 
4(f, to, €) and t(f,d,€) defined in Section 2. The form in which they are presented 
was determined largely by the fact that the space they had to occupy was limited, 
and also to some extent by the method which was adopted to calculate them. 

Table IV may be used to give 4(f,t,,€) for 17 probability levels, viz. 

€ = 0-005, 0-01, 0-025, 0-05, 0-1, 0-2, 0-3, 0-4, 0-5, 0-6, 0-7, 

0-8, 0-9, 0-95, 0-975, 0-99 and 0-995. 
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The basis of the table is the large sample approximation described in the last 
section, equation (30). The K, in this equation is simply the normal multiple 
which is exceeded with probability ¢. In small samples an error will be committed 
in taking this form of approximation, and in Table IV we give instead a multiple 
A(f, to, €), which is such that exactly 


€) = te — AU fos +3) (33) 


Even for samples giving f as small as 4, it is found that A never differs much from 
K,.. This stability makes interpolation easy so that, for given ¢, it is possible to 
provide a table of small compass covering all values of t, from — 00 to 00. 

For given e the tabled quantity A is a function of f and f, and accordingly 
Table IV, for given e, is one of double entry. 

Values are given corresponding to f = 4, 5, 6, 7, 8, 9, 16, 36, 144 and oo. The 
reason for choosing these values of f is that A differs from K, by a quantity of the 
order 1/,/f. The values f = 9, 16, 36, 144 and coare such that 12/,/f = 4, 3,2, land0 
respectively. Hence interpolations for intermediate f’s may be simply effected by 
considering A as a function of 12/,/f, since we are then dealing with a function 
tabled at equal intervals. 

The choice of the values of t, in the table is also determined by the necessity 
for interpolation to be simple and yet the table not to be unduly large. The 
whole range of t, from — 00 to 00 has to be covered. This has been done as follows. 
For t,/./2f between — oo and — 0-75, A is given against the quantity 


14.8)" 34 
ime ( (34) 
For t,/,/2f between — 0-75 and 0-75, A is tabled against 


For to/,/2f between 0-75 and 00, A is again tabled against y. The argument interval 
for both y and y’ is 0-1. It will be noted that y =(1—y’). 

From the point of view of interpolation other, perhaps simpler, functions of 
t, could have been used in constructing the table. The choice of y was made 


because the quantity (1 + >) is wanted in any case for substitution into equation 


(33) after A has been obtained. y’ had to be used in addition because of inter: 
polation difficulties in the middle of the table. 

It will be noted that in Table [V only the double entry tables fore = 0-005, 0-01, 
0-025, 0-05, 0-1, 0-2, 0-3, 0-4 and 0-5 are given. For e > 0-5 we can use the fact that 


a relation which is apparent from the form of equation (1). 
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To summarize, the steps necessary to find the value of dé such that P(t >t») = € 
are as follows. 


(i) Find y = (1 + 3) 


t 
according as | —_| is greater than 


to 
pA 
or less than 0-75. 


(ii) If f>9 calculate 12/,/f. 


(iii) Ife is one of the values 0-005, ..., 0-5, enter the appropriate part of Table [V 
and obtain A(f,t, €), by interpolating with respect to the quantities obtained in 
(i) and (ii). 

2\4 

(iv) Calculate O(f, ty, €) = (1 + 3) 


(v) Ife is one of the values 0-6, 0-7, ..., 0-995 calculate 6(f, —t), 1 —¢) and then 
change its sign. 


Inverse use of Table IV 


Next consider the calculation of t(f, 4, €), ie. the situation where é is given and 
ty is required so that P(t>t,) =e. In the previous section it was shown that a 
first approximation will be given by equation (32). The true relation between 
t(f,6,e) and é will, however, be obtained by replacing K, in (32) by the A which is 
tabled in Table IV; thus 


The drawback about this equation is that A is given in Table IV as a function of 
to, and ty is not now known. An iteration method is therefore necessary. 
The successive steps in this iteration may be summarized as follows: 


(i) The first approximation ¢, is given by (32); thus 


(ii) Next use Table IV to calculate directly 
A, = A(f, ty, €)- 


(iii) A second approximation ¢, is then given by substituting this A, in (37); 
thus 


b+ A,(1+ 


= 


4 
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(iv) Then by finding A, = A(f,t,, €) and substituting in (37) a third approxima- 
tion t, will be obtained. These steps must be repeated until two successive 
approximations give the same value. In practice this is found to occur very 
quickly so that there is likely to be no need of more than three approximations. 


(v) Ife is one of the values 0-6, 0-7, ..., 0-995 calculate t(f, — 4, 1 —€) and then 
change its sign. 


Use of Table V 


The iteration process described above is not very lengthy in actual practice, 
but even this amount of trouble would be unnecessary if A were tabled as a function 
of 6 as well as a function of t. Such a table, similar in extent to Table IV, could of 
course be calculated, but it did not seem worth while to do this, since the inverse 
method of using Table IV is not difficult. However, it has been thought useful to 
give for the single probability level ¢ = 0-05, a table which can be entered with 
6, since very often this conventional 1 in 20 chance is taken as a point of reference 
in statistical problems. 

In Table V, A is tabled against 


= Jap (38) 


at intervals of 0-1 from — 1 to 1. When dis given, this function of é may be quickly 
calculated and then A may be obtained by interpolating in Table V. The substitu- 
tion of this value of A in (37) gives the required ¢(f, 5, 0-05). As before ¢(f, 6, 0-95) 
may also be calculated from the same table, being equal to minus t(f, — 6, 0-05). 


8. EXAMPLES 


In the present section some numerical examples will be worked to illustrate 
the application of Tables IV and V. Reference will be made to theoretical results 
obtained in Sections 3-5. 

Example 1. Inasample of xn = 25a coefficient of variation v = 2-6 is observed. 
Obtain from this a “median” estimate of the population coefficient V. By a 
“median” estimate is meant one for which the chance of it exceeding the true 
V is 0-50. In other words our estimate is the 50 % fiducial limit. Hence from (8) 
the required estimate is 


where to= = = 1-9231 and f = 24. 
We have (1 +#) 1-0379; (1 +8)". 0-2674 


12/Jf = 2-4495. 


| 
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From Table IV, for ¢ = 0-5, we obtain by linear interpolation* with respect to 
y’ and 12/,/f, A = 0-0197. Hence from (33) 


& = 1-9231 — (0-0197) (1-0379) = 1-9027. 


Therefore from (39), v,, = 2-628. 

Example 2. A sample of n = 10 measurements of a normally distributed 
character 2 is given. The mean Z is 4-7 and the standard deviation s is 0-2. Obtain 
10 % fiducial limits for the proportion P in the population exceeding x = 5-0. 

As is shown in Section 5 this is equivalent to finding limits for U = (5-0—£)/c. 
We calculate first wu = (5-0 —%)/s = 1-5. Substituting this into (16) the upper and 
lower 10 % points for U are given by 4(9, 4-743, 0-9)/,/10 and 6(9, 4-743, 0-1)/,/10. 


We have 148) 21-5, 0-6067 


From Table IV for ¢ = 0-1 we obtain by linear interpolation* 


4:743, 0-1) = 1-347, 
and hence from (33) 


8(9, 4-743, 0-1) = 4-743 — (1-347) (0-6667) = 2-723. 


To obtain 6(9, 4-743, 0-9) we note by (36) that this is equivalent to minus 
6(9, — 4-743, 0-1). The corresponding A is obtained by entering the same table as 
before, with the same value of y, but in the part of the table for which ¢, is negative. 


We obtain A(9, — 4-743, 0-1) = 1-197. 
Hence 8(9, — 4-743, 0-1) = — 4-743 — (1-197) (0-6667) = — 6-538, 


and therefore (9, 4-743, 0-9) = 6-538. 
The upper and lower 10 % limits for U are therefore 6-538/,/10 and 2-723/,/10, 
i.e. 2-068 and 0-861. The corresponding limits for P are 0-019 and 0-195. 
Example 3. Suppose it is required to estimate fiducial limits for the value L 
of x in the normal population which is such that the proportion of the population 
exceeding L is 10%. Consider the case where the sample size is n = 10 and the 
limits required are the 5 °, limits at each end. From (18), we require 


E+kooss and 


_ /10 U, 


where k, (40) 


and U is the normal deviate exceeded with probability 0-1; i.e. U = 1-2816. We 
therefore require ¢(9, 4-053, 0-05) and #(9, 4-053, 0-95). In the first place these will 
be derived from Table IV by the inverse method described above. 


* For a note on the accuracy of linear interpolation see the end of Section 9. 
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Following out the successive steps to derive t(9, 4-053, 0-05) we obtain 
4-053 + (1-6449) (1 + 0-9126 —0-1503)! 


= 7-34 
4 (1 — 01503) 
(ii) y, = +3) = 0-5004; A, = A(9, 7-340, 0-05) = 1-6803, 
, _ 4053+ (1-6803) (1 +0-9126—0-1569)! 
(ili) t, = (1 — 01569) = 7-448, 


(iv) = +3 =0-4950; A,=A(9, 7-448, 0-05) = 1-6800, 
ts= 7-447. 


To obtain £(9, 4-053, 0-95) we require first t(9, — 4-053, 0-05). The successive 
approximations to this quantity are 


— 4-053 + (1-6449) (140-9126 —0-1503)t 2-200 
(1 —0-1503) 


(i) i= 


-} 
(ii) = —0-4602; A,=A(9, — 2-200, 0-05) = 1-5933, 
(iti) t, = — 2-249, 


(iv) y = += = —0-4684; A,=A(9, — 2-249, 0-05) = 1-5925, 
tz = — 2-250; 


t(9, 4-053, 0-95) is equal to minus t(9, — 4-053, 0-05) and therefore equals 2-250. 
Hence from (40) 


7-447 2-250 
ig = 2355; 


= 0-712. 

The above calculations have been performed using Table [V and the inverse 
method described in the previous section. Actually this example could have been 
treated much more simply by Table V. We shall proceed to show this, but it 
must be remembered that Table V only covers cases where € = 0-05 and € = 0-95. 
For other probability levels Table IV will have to be used. 

To obtain t(9, 4-053, 0-05) and ¢(9, — 4-053, 0-05) Table V has to be entered with 


a= Ne) (2 + 7) , where é has the two values + 4-053, i.e. has to be entered with 


n' = + 0-6908. These give values of A equal to 1-6800 and 1-5925 and are the same 
as the final A’s obtained in the iterations. We have then only to perform the 
calculations for t by substituting in (37). 
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9. SUMMARY 


An account of some of the applications of the non-central t-distribution* has 
been given. Tables of the distribution have been provided together with numerical 
examples of their use. 

Ift, and é are connected by the equation P(t >t, | 6) = e, then it was found that 
a good approximation to 4, given to, is 


\t 
-K,(1+33) 


where K, is the normal deviate exceeded with probability e. The equivalent 
approximation 

62 K2\t 
6+K, At + 3) 


is good for calculating t, given 6. These approximations were seen to have ad- 
vantages over other approximations also based only on the normal distribution. 
Tables IV and V at the end of the paper make possible a more exact determina- 


tion of 6 given ty and ty given 4, respectively. In these tables a quantity A, is given 
such that 


2f 
2 
and so to= f 


(1-3) 


Note on the accuracy of the tables. In the greater part of the tables A, is 
correct to as many figures as are shown. Occasionally, however, the values of 
A, may be almost 2 units wrong in the last figure. It was nevertheless con- 
sidered worth while to give all the figures shown. 

Throughout Table IV linear interpolation will always give a result not more 
than a } unit wrong in the second last figure. This is also true of Table V, 
except between 7’ = —0-8 and —1-0 for f= 9 and 16. Here linear interpolation 
with respect to 7’ may be 1 unit wrong in the second last figure. 


* t=(z+8)/Vw, where z is a unit normal deviate, w is distributed independently as x?/f, and f 
is the number of degrees of freedom of x?. 
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386 Applications of the non-central t-distribution 


TABLE V 
Values of A as a function of 6 
«=0-05 
y f 4 5 6 7 8 9 16 36 144 oa) 
1-0 | 1-636 | 1-648 | 1-655 | 1-660 | 1-662 | 1-6638 | 1-6665 | 1-6634 | 1-6559 | 1-6449 
0-9 | 1-643 | 1-655 | 1-662 | 1-666 | 1-668 | 1-6695 | 1-6711 | 1-6667 | 1-6576 | 1-6449 
0-8 | 1-650 | 1-662 | 1-668 | 1-672 | 1-674 | 1-6747 | 1-6751 | 1-6691 | 1-6586 | 1-6449 
0-7 | 1-657 | 1-668 | 1-674 | 1-677 | 1-679 | 1-6796 | 1-6782 | 1-6707 | 1-6589 | 1-6449 
0-6 | 1-664 | 1-675 | 1-680 | 1-682 | 1-684 | 1-6838 | 1-6804 | 1-6714 | 1-6587 | 1-6449 
0-5 | 1-671 | 1-681 | 1-686 | 1-687 | 1-687 | 1-6871 | 1-6817 | 1-6709 | 1-6580 | 1-6449 
0-4 | 1-679 | 1-687 | 1-690 | 1-691 | 1-691 | 1-6896 | 1-6816 | 1-6698 | 1-6568 | 1-6449 
0-3 | 1-687 | 1-693 | 1-694 | 1-693 | 1-692 | 1-6902 | 1-6804 | 1-6677 | 1-6550 | 1-6449 
0-2 | 1-693 | 1-697 | 1-696 | 1-694 | 1-692 | 1-6898 | 1-6779 | 1-6646 | 1-6529 | 1-6449 
0-1 | 1-698 | 1-699 | 1-697 | 1-693 | 1-690 | 1-6874 | 1-6738 | 1-6606 | 1-6504 | 1-6449 
0-0 | 1-703 | 1-699 | 1-695 | 1-690 | 1-686 | 1-6827 | 1-6682 | 1-6558 | 1-6477 | 1-6449 
—O-1 | 1-702 | 1-695 | 1-689 | 1-684 | 1-679 | 1-6756 | 1-6611 | 1-6503 | 1-6447 | 1-6449 
—-2 | 1-698 | 1-688 | 1-680 | 1-674 | 1-669 | 1-6657 | 1-6525 | 1-6442 | 1-6417 | 1-6449 
—0-3 | 1-687 | 1-676 | 1-667 | 1-661 | 1-657 | 1-6535 | 1-6427 | 1-6378 | 1-6388 | 1-6449 
—0-4 | 1-670 | 1-658 | 1-650 | 1-645 | 1-642 | 1-6391 | 1-6322 | 1-6314 | 1-6359 | 1-6449 
—O0-5 | 1-646 | 1-636 | 1-630 | 1-627 | 1-624 | 1-6231 | 1-6213 | 1-6252 | 1-6334 | 1-6449 
—0-6 | 1-615 | 1-610 | 1-607 | 1-606 | 1-606 | 1-6066 | 1-6108 | 1-6195 | 1-6313 | 1-6449 
—0-7 | 1-582 | 1-583 | 1-585 | 1-587 | 1-589 | 1-5911 | 1-6019 | 1-6150 | 1-6299 | 1-6449 
—O-8 | 1-551 | 1-559 | 1-565 | 1-571 | 1-575 | 1-5792 | 1-5954 | 1-6122 | 1-6291 | 1-6449 
—0-9 | 1-531 | 1-544 | 1-553 | 1-561 | 1-567 | 1-5722 | 1-5925 | 1-6116 | 1-6292 | 1-6449 
—1-0 | 1-528 | 1-543 | 1-554 | 1-563 | 1-569 | 1-5744 | 1-5952 | 1-6141 | 1-6307 | 1-6449 


For note on accuracy of table see end of Section 9. 


APPENDIX 
On the calculation of Table IV 


Before deciding on the method to be employed in calculating the tables a wide variety of 
methods were considered. We examined these with a view to finding one which would 
best combine the attributes of speediness, lack of opportunity for numerical errors, and 
adaptability to moderately large-scale work. In this appendix will be described the more 
promising of the methods tested, including the one finally adopted. 

The non-central ¢ being defined as in (1) we have for the probability that ¢>¢), with the 
notation of Section 2, 


vi 


We want to find pairs of values 6 and t, such that P(/,4,¢)) has specified values ¢€; i.e. we 
want to evaluate the quantities ¢,(/,4,¢) and d(f, ty, 6). 
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(i) Direct Methods 
These are based on the idea of evaluating the right-hand side of (41) directly for various 
values of 6, f, and ¢, and then obtaining the quantities required by a process of inverse 


interpolation. An obvious way to evaluate P(f, 6, ¢)) is by quadrature. For small even valies 
of f, however, the following formulae: 


P(0<t<o|f,d) = du, 
0 


P(0<t<t|f,8) = — e)-4 


ao 
where Hh, (x) = us dy 
0 
were found to be more convenient to use. The Hh, functions have been tabled by J. R. Airey 
(1931). 


This modified procedure was still unduly lengthy and not well adapted to large scale 
work. It provides, however, a useful check on values obtained by other methods. 


(ii) Solution by means of a differential equation 
For fixed values of f and € the equation P(f,é,t)) = € may be regarded as specifying é and 
t, as implicit functions of each other. This implies that for constant ¢, 
(f, 6, to) | 
dé at, 
From (41) this is found to give 


dt 


(42) 
_ Ahy_(2) 
where J;(x) = 
It may be shown that Ax) (43) 


For given values of f and ¢, the value of ¢, corresponding to d = 0 can be found in a straight - 
forward manner since it is simply a probability level of Student’s ¢. Starting from this 
initial value we can solve the differential equation (42) numerically and obtain the values 
of *, corresponding to non-zero values of 6. A trial of this method was made, taking f = 6, ; 1 
and ¢ = 0-05. The differential equation was solved by the Adams-Bashforth process. The 
values of higher derivatives of ¢ with respect to é at the point d = 0 are required in this 


process. By means of (43) the formulae for these quantities were obtained in quite a simple 
form. 


This method gave promising results—it was suitable for large-scale work and there was 
comparatively little chance of inaccuracies entering the work—but it suffered from the 
defect that we could proceed to build up our table by small inerements only of é (actually 
increments of 0-05). As we wanted to be able to deal with large as well as small values of 6 
we decided that the process was not sufficiently speedy for our purposes. 

It may be remarked, however, that a similar type of approach might be quite suitable in 
cases where the expression for the differential coefficient is simpler than was ours. 
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(iii) Solution based on normal approximation 


The method finally adopted for calculating the greater part of the table is based on the 
use of Edgeworth’s development of the normal probability function. In a recent paper 
E. A. Cornish & R. A. Fisher (1937) have given formulae, derived from this development, 
which make it possible to find percentage levels of a probability function when the cumulants 
of the function are given. These formulae are of most practical use when the probability 
function is itself nearly normal in form. With markedly non-normal distributions, the 
development may not converge, or at least more terms may be required than is practically 
convenient. 

The cumulants of the non-central ¢-distribution are not difficult to find. The first three are 
given in equations (19), (20) and (21) and the values of ,/f, for different f and é are given in 
Table I. It was pointed out in the discussion of Section 6 that the non-central ¢-distribution 
would often be markedly skew. It would appear, therefore, that it may not be a suitable 
distribution to express by a development of the normal function. Fortunately it is possible, 
as is pointed out in equation (26), to expr-ss problems demanding the quantities ¢(f, 4, €) and 
5(f,t ,€) in a form such that the solution can be based on a distribution much more nearly 
normal than that of non-central ¢. 


It was shown that the statement 


P(t>t,|f,8) =e 
was equivalent to the statement 


P(Y <8|f,&) 
where Y= 


(Here z is a unit normal deviate and y is distributed in the y-distribution with f degrees of 


freedom.) The problem of finding d(f,to,€) is therefore equivalent to finding a percentage 
level of Y. 


The cumulants of Y may be obtained immediately from the formulae 


In particular for r= 2, = 
while for r$2,  «(¥) = «,(X). 


Hence for all r the shape coefficients y,( Y) increase from 0 to y,() as ty increases from 0 to co. 
The y’s of Y are therefore always smaller than those of x. But these latter are quite small even 
for small f. Hence the distribution of Y can never be far removed from normality. The distri- 
bution of Y is therefore suitable for development by Edgeworth’s method and Cornish & 
Fisher’s formulae may conveniently be used to provide percentage levels. 

In this approach the problem of finding 4(f, t),€) appears as more fundamental than that 
of finding ¢(f, 6, €). Hence Table IV is designed to facilitate the direct calculation of 8(f, to, €). 
Its use to provide é(f,6,¢€) demands a process of iteration which is not however difficult. The 
actual quantity A(f,t),¢) which is tabled differs by a small amount from K,, the normal 
deviate which is exceeded with probability e. This difference is given by the Cornish-Fisher 
formula. All the terms in this formula were used. This necessitated the use of the first six 


cumulants of y. A method of obtaining these has already been described in a previous note 
(N. L. Johnson & B. L. Welch, 1939). 
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The greater part of the computation of Tables [IV and V has been the work of one of us 
(N. L. Johnson). The remainder and much subsidiary checking work is due to Miss Catherine 
M. Thompson. We gratefully acknowledge her assistance. 
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MISCELLANEA 
(i) Stirling's formula with remainder* 
By E. V. HUNTINGTON, Harvard University 


We assume the following theorem as known: 


n!= 


where n = 2, 3, 4, ..., and 
1 1 1 1 691 
= 360n** 1260n®  1680n? * 1188n®  360360n" 
= 0-083, 333n-1 — 0-002,777n-2 + 0-000,794n-5 — 0-000,595n-? 
+ 0-000,842n-* — 0-001,918n-" +... 


We assume also that the error involved ’in stopping the series at any point is less (in 


absolute value) than the first term dropped. (For proof, see, for example, E. B. Wilson, 
Advanced Calculus, p. 456.) 


Putting Q = e?, and expanding and collecting terms, we find: 
nt = y(2m) 
12n 288n? 51,840n* 2,488,320n* 
163,879 5,246,819 534,703,531 
209,018,880n5 +7 5,246,796,800n® 902,961,561,600n? 
4,483,131,259 432,261,921,612,371 
~ 86,684,309,913,600n5 514,904,800,886,784,000n° 
= 1+ 0-083,333,333n—! + 0-003,472,222n-* 
— 0-002,681,327n-3 — 0-000,229,472n-4 
+ 0-000,784,039n-5 + 0-000,069,728n-® — 0-000,592,166n-? 
— 0-000,051,718n-§ + 0-000,839,499n-* + .... 


(For the terms up to and including n-?, compare, for example, H. T. Davis, Tables, 1, 180.) 
Let Q,= the sum of the Q-series up to and including the term in n-*, Q,;= the sum of the 
Q-series up to and including the term in n-', ete. 
Then we find: 


Q=1+R,, where 0-083,33n-!<R,< 0-085,07n-". 
Q=Q,+R, where 0:002,07n-*<R,< 0-003,48n-. 
Q = Q,4+R;, where —0-002,80n-*< R,< —0-002,59n-5. 
Q=Q;+ Ry, where —0-000,238n-*<Ry< 0-000,18n-*. 
Q=Q,+Rs, where 0-000,66n-5<R,< 0-000,82n-5. 
Q=Q,;+R,, where —0-000,24n-*<R,< 0:000,07n-*. 
Q=Q,+R,, where —0-000,62n-? < R, < —0-000,39n-". 
Q=Q,+R,, where  0-000,39n-%. 
Q=Q,+R,, where —0-000,01In-*< R,< 0-000,88n-*. 
* Presented to the American Mathematical Society, 8 April 1939. 
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(ii) A note on the interpretation of quasi-sufficiency 


By M. 8. BARTLETT 


1. It has been shown by Fisher that if in problems of location we confine our attention 
to samples with the same configuration C, where C denotes the differences between the 
observations, then no statistical information in his sense is lost whatever reasonable statistic 
T we choose as an estimate of the unknown parameter 6. This result follows for probability 
laws of the type under consideration, 


P(x | 0)=f(x—) dx, 
from the relation PS | 0) = p(T | C, A) p(C), 


where S denotes the sample of observations. For the conditional statistic T | C—a random 
or statistical variable 7’ varying for given or fixed C—I have when a relation like (1) exists 
used the term quasi-sufficient, in view of the correspondence of (1) with the relation for a 
sufficient statistic 7’, 

(2) 


Conditional statistics also occur in the theory of more than one unknown, when the fixing 
of some statistical variables has the effect of eliminating unwanted parameters from our 
distributions. Confining our attention, however, to problems which are primarily problems 
in one unknown only, we require to examine relations of the type (1) further, in view of some 
recent comments by Welch* on the extent to which any conditional statistic like T’'| C can 
claim to be sufficient. . 

From Fisher’s theory of information, it follows that when estimating @ from two or more 
samples, we should naturally weight the different samples according to the information 
J(C) in each. But Welch has pointed out that if we are concerned with interval estimation 
of 6 from a single sample, or with the most efficient statistical test associated with a value 0, 
it does not appear that all our information, in the widest sense, is retained in T'| C. If, for 
example, the only alternative to 0, happens to be 4,, it is known that the appropriate criterion 
for discriminating 9) and 0, is p(S' | 6,)/p(S | 6)). While from equation (1) it follows that this 
criterion is equivalent to p(T'| C,0,)/p(T | C, @,), the former criterion must be referred to the 
distribution of S, not of S| C. 

In partial answer to this criticism, it might be noticed that when considering S | C, we are 
not bound to choose the same significance level « for each C observed. Suppose we choose 
a(C,) = e€ for the first sample. If the second sample gave a different configuration C,, we 
might take a(C,) such that the power of the test was a maximum for the two configurations 
C, and C, if we had chosen @(C) such that }{a(C,) +a(C,)} = e. Similarly for C3, and so on. 
In the long run, the significance level adopted on this rule would be the average value 
E,{a,(C,)}, where «,(C,) denotes the significance level taken for the rth sample when making 
the rth test. For any finite number of samples the average significance level adopted is not 
exactly equal to ¢, but the level becomes equal to ¢ in the limit; thus we could argue that, 
theoretically at least, we should eventually reach the most powerful test for a given signi- 
ficance level merely from consideration of S| C. In the above argument we allow the con- 
figuration distribution to be generated by the samples themselves. It is of course theoretical 
quibbling, for if we may assume that C has a definite distribution p(C) which is already 
known, we should use this fact rather than wait for its confirmation by repeated sampling. 

My own answer to the query of how far 7'| C can be considered sufficient is that it is only 
sufficient provided that we do really agree to consider samples with the same configuration 
as that observed. The rejection altogether of the factor p(C) has two consequences, (i) we 
have seen that a test might conceivably be derived on the basis of p(C) more powerful than 


* B. L. Welch. Annals Math. Statist. 10 (1939), 58-69. 
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the test based on T'| C with fixed significance level a(C) = ¢, (iii) the fixing of C implies that 
whatever selection for C there might be distorting p(C), the validity of tests based on T'| C 
will be unaffected. 

The independence of the test from selection for C is the explicit advantage gained, 
though in addition it is possible that if our specification of the population is erroneous, that 
it is less so for p(T’ | C) than p(S); for example, if the configuration C is similar to the expected 
configuration on normal theory, it may prove more feasible to assume for a certain range of 
populations differing from normality that normal theory may still be approximately sub- 
stituted for the unknown probability p(7'| C) than that we may do so for p(S) or p(T’), 
though I have not investigated this point. 


2. It is instructive to remember Fisher’s comparison of the quasi-sufficient statistic 
T | C with the sufficient statistic 7’, where T’ may be written in full, 7'| n, where n is the size 
of the sample. The equivalence of the two types of statistic when we regard C as fixed can 
be extended, in terms of our theoretical argument, if conversely we assume that n had in 
some problem a definite and known distribution from sample to sample. Consider similarly 
the test of significance of a regression coefficient. The orthodox theory is to consider the 
conditional statistic b | 2(a—)*, where b is our estimate, and X(a—)* the sum of squares of 
deviations of the independent variable x. Suppose for the sake of argument that the true 
variance of the residual dependent variable y, was known to be unity, and the z’s are such 
that 2(2—Z)?= 1 on Mondays and 1-44 on Tuesdays. Then for an 0-025 significance level 
(one tail), the usual practice would be to take 1-96 as the significance level for b (from b, = 0) 
on Mondays, and 1-96/1-2 = 1-633 on Tuesdays. The power of the test in relation to the 
alternative that 6, = 3-92 is 0-9860. But if we were satisfied with adjusting the significance 
level to be 0-025 merely in the long run for Mondays and Tuesdays together, we may raise 
the power of the test to its maximum value of 0-9878, by taking the Monday significance level 
at 6 = 1-87 («=0-0307) and the Tuesday level at 6 = 1-723 («=0-0194). 


SuMMARY 


In a, theoretical note on the interpretation of quasi-sufficient statistics in problems of one 
unknown parameter, it is agreed with Welch that full sufficiency properties can only be 
claimed if we deliberately confine our attention to conditional samples of the type observed; 


but it is pointed out that the resulting inferences are independent of any selection operating 
on the variables that have been fixed. 


(iii) The cumulants and moments of the binomial distribution, and the 
cumulants of x? for a (m x 2)-fold table 


By J. B. 8S. HALDANE, F.R.S. 


From the Department of Biometry, University College, London 


The first four cumulants of the distribution of y? for a (n x 2)-fold table when samples are 
finite, have already been given (Haldane, 1937). These and higher cumulants and moments 
can be calculated by a simpler method. Consider a sample of s, the probability of a success 
being p, and p+q = 1. Pearson(1919) pointed out that for moments of (p + q)* about its mean, 
the generating function is (qe? + pe-")’, and Romanovsky (1923) gave a recurrence formula 
for the moments. That for the cumulants is much simpler. 


Let U = ge” + 
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Then the cumulant-generating function 
x 
K(t)= X —=slogv. 
r=2 T ! 


To find the cumulants for s = 1, we note that 


pale”! — 
U 


So a KO = pas, KO) 
dg 


Equating the coefficients of é’/r!, we find 


Ke = Pq, 
and if r>2 = (1) 
Let pg = c, p—q = g; then 
dx, dc dg 
dq? dg 
Hence if f(c), 
Korea = gef'(c), \ (2) 
— 6c) f"(c) + — 4c) f"(c). 


From these equations we can very rapidly calculate successive values of «,, since K,= ¢, 
and find: 

K,= 0, 
¢, 
Cg, 
Ky= c— 6c?, 
Ks= g(c— 12c*), 
Kg = c—30c? + 120c8, 
g(c— 60c* + 360c'), 
Kg= C— 126c? + 1,680c* — 5,040c*, 
Ky = g(c — 252c? + 5,040c3 — 20,160c*), 

Kip = 510c? + 17,640c? — 151,200c* + 362,880c5, 

Ky, = g(c— 1,020c? + 52,920c* — 604,800c* + 1,814,400c), 

Kig= C— 2,046c® + 168,960c3 — 3,160,080c* + 19,958,400c5 — 39,916,800c*. ...... (3) 


If each of the above cumulants be multiplied by s, the moments about the mean can now 
be calculated from the expressions given by Fisher (1928) and Haldane (1938). If p = $ we 


have K(t) = slogcosh }, 
8 8 8 17s 31s 691s 


while if g is very small we have for the cumulant-generating function of a Poisson series 
K(t) = se(e'— 1-2). 


— K(t)= 
ot 
a 
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The coefficient of c? is —[s" + (— 1)’ — 3]. So when qis small, but its square is not neglected, 

the first order correction to the Poisson cumulant-generating function is 
K(t) = sq(e'— 1 —t) + sq*(e** — e*). 

The numerical coefficient of the highest power of c in x, is (r—1)! when r is even, and 
4(r—1)! when r is odd. 

Consider a sample of s, in which @ successes are recorded. Then 
_ (a—sp)? 

spq 


But a — sp is the departure from the mean of the binomial distribution (p + qg)*. Hence the 
rth moment of the distribution of x? (for one degree of freedom) about zero, is 


Mer 
where is the 2rth moment of (p+q)*. 
But if 4; and x; be the rth moment about the mean, and the rth cnmulant, of the x? 
distribution, then 
etc., Ki = My, ete. 
Making the nécessary substitutions, we find, for the cumulants of x? in terms of those of 
the binomial distribution: 


K, = K,, 
Ky = (sc)-*(2K2 
Ks = 8x2 + 2(5x5 + 6x, Ky) + Kg), 
Ki = (sc)~*[48K} + + Kq) + 8(4K2 + Ks + 3K 
KS, = (sc)-5[384«3 + 960( 5x3 K3 + Ky) + 80(25K3 Ky + 16K, Ki + 28K, 
+ 6x2 Kg) + 2(63K2 + 100K, Ky + + Ks) + Kio], 
Ki, = + 9,600(10«3 x3 + 3x3 Ky) + 4,800( + 25K, Ky 
+ 8K3 + 14K3 Ky + 2K3 Kg) + 40( + 672K, Ky K; + 189K, K2 
+ 226K} Kg + 300K, Ky Kg + 180K, Ky Kz + Kg) + 4(113K5 + 198K, K, 
We now substitute the values of x, given in equations (3) multiplied by s, putting 
k = (pq)*= ec". 
We therefore have, for the cumulants of x? with one degree of freedom: 
K,=1, 
Kg= 2+(k—6)s-', 
ky= 8+ 2(11k—56) + (k? — 30k + 120) s-2, 
K,= 48+ 96(4k — 19) s-! + 16(7k? — 125k + 420) s-? + (k3 — 125k? + 1,680k — 5,040)s-3, 
K,= 384+ 960(7k — 32) + 400( 15k? — 214k + 648) + 6(81k5 
— 3,908k? + 38,420k — 98,496) + (k4 — 510k8 + 17,640k? 
— 151,200k + 362,880) s-4, 
Kg= 3,840 + 9,600(13k — 58) s—! + 9,600( 26k? — 327k + 924) s—* 
+ 40(1,729k? — 56,236k? + 459,024k — 1,065,792) + 4(50144 
— 59,398k5 + 1,289,244k? — 8,824,320k + 18,555,840) s—4 
+ (k5 — 2,046k4 + 168,960k3 — 3,160,080k? + 19,958,400k 
— 39,916,800)s-§. ...... (5) 
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When p = }, k = 4, and we have, for n degrees of freedom: 

2ns—(s— 1), 

8ns—*(s— 1) (s— 2), 

K,= 16ns—*(s — 1) (3s? — 15s + 17), 

128ns—*(s — 1) (s — 2) (38? 21s-+ 31). 

Kg= 256ns—*(s — 1) (15s*— 210s + 990s? — 1,950s + 1,382). 
If there are n samples, with different values of s, we have, for the cumulants of x*, where 


k= , and R; = 
2pq 


2n{1+(h—3) Ry), 
k y= 4n[2+4(11h— 28) R, + (h?— 15h +30) 
8n[6+ 12(8h— 19) R, + 4(14h? — 125h + 210) Ry + — 63h? + 420h — 630) Ry], 
ks = 16n[24+120(7h — 16) R, + 100(15h?— 107h + 162) Ry + 3(S1A° 

— 1,954h? + 9,560h — 12,312) Ry + (h4— 255h3 + 4,410h?— 18,900h + 22,680) Ry], 
32n{120 + 600(13h — 29) R, + 600(52h? — 327h + 462) Ry + 10(1,729h3 

— 28,118h? + 114,756h — 133,228) R, + 2(501h* — 29,699h3 + 322,311h2 

— 1,103,040h + 1,159,740) Ry + 1,023h4 + 42,240h3 — 395,010h2 


+ 1,247,400h — 1,247,400) ...... (7) 
When p = q = }, we have: 

2(n—R,), 


8(n—3R, +4R,), 

k,= 16(3n —18R, + 32R, —17Rs), 

ks= 128(3n —30R, + 100R, — 135R, + 62R,), 

256(15n — 225R, + 1,200R, — 2,940R, + 3,332R,—1,382R,). ...... (8) 


The first four of equations (5, 6, 7, 8) have already been given in a slightly different form 
by Haldane (1937). The limiting forms of equations (5) and (7) when s tends to infinity and 
k to zero, while ks = g, have been given by Haldane (1938). However, the expression for x, 
there given is incorrect. The coefficient of R, in the expression for x, should be 124,800. 

The extension of equations (7) would be rather tedious. However, those of equations (6) 
and (8) would not be very difficult. The coefficient of z*"/2r! in the expansion of log cosh ¢ is 
the value of (d/dx)**-(1—tanh? x) when x = 0, and can easily be calculated, since this 
differential coefficient is a polynomial in tanhz. The equations for moments in terms of 
cumulants can easily be extended when all odd cumulants vanish. In this case a useful check 
can be obtained from the fact that when s = 2 the cumulant-generating function of x? 
is t+ logcosht. 


SUMMARY. 


Expressions are obtained for the first twelve cumulants of the binomial distribution, and 
a simple recurrence formula for further cumulants. The first six cumulants of x* for a 
(n x 2)-fold table when expectations are small, are deduced. 
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CAUSATION AND CORRELATION 


(iv) The principles of the mathematical theory of correlation. By A. A. 
TscHuprow. Translated by M. Kanrorowrtscu. Wm. Hodge & Co. Ltd. Price 
12s. 6d. 


This book is a translation of an enlarged reproduction of lectures delivered by Professor 
Tschuprow at the University of Oslo, and originally printed in German about a decade 
ago. It will be of interest to many people in that it is a complete survey of correlation 
theory and its underlying principles. The theory is expounded along the now familiar 
classical lines which had their origin in Karl Pearson’s writings, and although the appli- 
cation of this theory to practical problems is perhaps a little out of date, it still forms a 
necessary background which the student must acquire, and of which a proper understanding 
will always be essential. 

The book as a whole shows an astonishing ‘‘patchiness”’ in writing. It may be the 
fault of the translator, but the fact remains that the meaning of whole paragraphs is 
sometimes very obscure, while at other times the ease and lucidity with which arguments 
are presented are unrivalled by any comparable treatise. This unevenness is unfortunate, 
since the book will be read more for the exposition of underlying principles than for its 
algebraic development of the theory; indeed it may be questioned whether those who 
have not previously mastered the elements of statistical theory and probability will obtain 
any profit from its reading. There is no clear discussion of probability and what it means, 
and the reader is left to find out from examples how a probability may be calculated. 
This cannot be deemed a fault, but seems to point to the book being useful only if previous 
knowledge of the subject is obtained elsewhere. 

The chapter on stochastic connexion and functional relationship is good, and may be 
read with advantage by any statistical worker and teacher. The method of approach is 
the same as that of the present day, and will doubtless be followed for many years to 
come. It is, however, a little astonishing to find that the translator makes no use of the 
term ‘‘random variable’’, which has long passed into common use. The general discussion 
throughout the book and in particular in the chapter entitled ‘‘Object and value of Corre- 
lation Measurement” is stimulating, even if the reader is not always in agreement with the 
author. The mathematical development of the theory is set out with clarity and freshness, 
which make it enjoyable reading. The notation is a little cumbersome, but, once mastered, 
it does not prove difficult to follow. 

Taken as a whole, this book is a worthy contribution to correlation literature, and it is 
surprising that no translation has been published until this date. It certainly should be 
read by all who attempt to gain an understanding of statistical theory. 

F. N. DAVID. 
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